← Learning Portal

LLM From Scratch

A complete technical textbook covering everything from mathematical foundations to frontier model architectures, reinforcement learning, and beyond.

30

Chapters

1.8 MB

Content

~2000

Pages

Part I: Foundations

Mathematical prerequisites, neural network fundamentals, and the path to Transformers

1

Mathematical Foundations

Linear algebra, calculus, probability, and information theory for deep learning

58 KB
2

Neural Networks Deep Dive

Perceptrons to deep networks, backpropagation, activation functions, optimization

73 KB
3

Sequence Modeling

RNNs, LSTMs, GRUs, seq2seq, and the attention revolution

32 KB
4

The Transformer Architecture

Self-attention, multi-head attention, positional encoding, encoder-decoder

80 KB

Part II: Language Model Training

From raw text to pretrained language models

Companion Guide

A standalone walkthrough that keeps every major RL and RLHF quantity tied to one toy LLM answer tree

Ex

Worked LLM RL Example

One prompt traced through return, value, advantage, GAE, PPO, KL control, reward shaping, critics, and GRPO

81 KB

Part III: Reinforcement Learning & Alignment

From RL fundamentals to RLHF, DPO, and modern alignment techniques

Part IV: Model Families

The evolution of frontier LLMs from GPT to DeepSeek

Part V: Efficiency & Optimization

Making large models practical: attention, quantization, sparsity, and adaptation

Part VI: Advanced Capabilities

Multimodal understanding, reasoning, agents, and tool use

Part VII: Safety, Interpretability & the Future

Understanding, aligning, and evolving language models

Part VIII: Practice & Production

Data pipelines, inference systems, evaluation, prompt engineering, and synthetic data

Essays by the Author

Opinionated perspectives on the future of AI

E1

Continual Learning as the Path to AGI

Why compression into concept space — not external memory — is the key to recursive self-improvement

~15 min

Research Notes

Deep dives into open questions and emerging ideas

R1

Why Neural Networks Forget — And How They Might Stop

Catastrophic forgetting mechanisms, brain-inspired solutions, self-directed learning, continuous learning systems

~40 min