Attention in Transformers: Concepts and Code in PyTorchUnderstand and implement the attention mechanism, a key element of transformer-based LLMs, using PyTorch.StatQuest
How Diffusion Models WorkLearn and build diffusion models from the ground up, understanding each step. Learn about diffusion models in use today and implement algorithms to speed up sampling.
Introduction to on-device AIDeploy AI for edge devices and smartphones. Learn model conversion, quantization, and how to modify for deployment on diverse devices. Qualcomm
How Transformer LLMs WorkUnderstand the transformer architecture that powers LLMs to use them more effectively.Jay Alammar, Maarten Grootendorst