← Learning Portal

LLM From Scratch

Clearer Part III rewrites using the same book renderer, but a slower and more first-principles teaching style.

4
Chapters
57 KB
Content
Review
Separate Path

This path is only for review. The original chapters remain unchanged in the main book.

All four Part III rewrite chapters are available here, along with a standalone worked-example guide. Once the content and rendering are right, these can replace the existing Part III chapters.

Companion Guide

One standalone walkthrough that keeps every RL quantity tied to the same toy LLM answer

Part III: Reinforcement Learning & Alignment

Alternative chapters for RL fundamentals, policy optimization, RLHF, and DPO/alignment methods