Quantizing LLMs with PyTorch and Hugging Face
Optimize Memory and Speed for Large Language Models with Advanced Quantization Techniques
5.00 (3 reviews)

737
students
2 hours
content
Nov 2024
last update
$54.99
regular price
What you will learn
Gain an intuitive understanding of linear quantization
Learn different linear quantization techniques
Learn from a high-level how 2 & 4-bit quantization works
Learn how to quantize LLMs from Hugging Face
Screenshots




6287745
udemy ID
11/14/2024
course created date
11/18/2024
course indexed date
Bot
course submited by