LLM Concepts
From transformer architecture to cutting-edge research — each concept explained with intuition, math, and connections to the bigger picture.
Start Module 01Curriculum
A structured path through the course content.
Foundational Architecture
Core transformer components — self-attention, multi-head attention, feed-forward networks, residual connections, and architectural variants like MoE and sparse attention.
Input Representation
Tokenization, positional encoding, embeddings, and how text becomes numbers.
Training Fundamentals
Optimization, loss functions, scaling laws, and training data.
Distributed Training
Parallelism strategies and distributed systems for large-scale training.
Alignment & Post-Training
RLHF, DPO, reward modeling, and preference learning.
Parameter-Efficient Fine-Tuning
LoRA, adapters, and methods for efficient model adaptation.
Inference & Deployment
Serving, decoding strategies, caching, and quantization.
Practical Applications
RAG, agents, tool use, and prompt engineering.
Safety & Alignment
Attacks, defenses, alignment failures, and guardrails.
Evaluation
Benchmarks, metrics, and evaluation methodology.
Advanced & Emerging
Cutting-edge research and emerging techniques.