Quantization for GenAI Models
Unlock the power of model optimization! Learn how to apply quantization and make your GenAI models efficient with Python
4.55 (30 reviews)

3,272
students
2.5 hours
content
Feb 2025
last update
$54.99
regular price
What you will learn
Understand model optimization techniques: Pruning, Distillation, and Quantization
Learn the basics of data types like FP32, FP16, BFloat16, and INT8
Master downcasting from FP32 to BF16 and FP32 to INT8
Learn the difference between symmetric and asymmetric quantization
Implement quantization techniques in Python with real examples
Apply quantization to make models more efficient and deployment-ready
Gain practical skills to optimize models for edge devices and resource-constrained environments
Screenshots




6252973
udemy ID
10/24/2024
course created date
11/7/2024
course indexed date
Bot
course submited by