Quantization for GenAI Models

Unlock the power of model optimization! Learn how to apply quantization and make your GenAI models efficient with Python
4.55 (30 reviews)
Udemy
platform
English
language
Data Science
category
Quantization for GenAI Models
3,272
students
2.5 hours
content
Feb 2025
last update
$54.99
regular price

What you will learn

Understand model optimization techniques: Pruning, Distillation, and Quantization

Learn the basics of data types like FP32, FP16, BFloat16, and INT8

Master downcasting from FP32 to BF16 and FP32 to INT8

Learn the difference between symmetric and asymmetric quantization

Implement quantization techniques in Python with real examples

Apply quantization to make models more efficient and deployment-ready

Gain practical skills to optimize models for edge devices and resource-constrained environments

Screenshots

Quantization for GenAI Models - Screenshot_01Quantization for GenAI Models - Screenshot_02Quantization for GenAI Models - Screenshot_03Quantization for GenAI Models - Screenshot_04
6252973
udemy ID
10/24/2024
course created date
11/7/2024
course indexed date
Bot
course submited by