English · Español

Study any chapter¶

The ideal path is sequential — the dependency chain in LYNX_CORTEX.md §6 exists for a reason. But the curriculum also works as a reference work: every row links to the chapter, lists its direct conceptual prerequisites (informational, never locking), the phases it unlocks, and the concepts it teaches. Use site search or this table to jump straight to the topic you need.

Dependency graph¶

graph LR
  P00["00 Project Foundations & Learning Methodology"]
  P01["01 Hardware & Computing Substrate"]
  P02["02 Numerical Representation"]
  P03["03 Linear Algebra from First Principles"]
  P04["04 Calculus & Optimization for AI"]
  P05["05 Probability & Information Theory"]
  P06["06 Python for AI Engineering"]
  P07["07 Scalar Autograd from Scratch"]
  P08["08 Tensor Autograd from Scratch"]
  P09["09 Tiny MLP & Module Abstraction"]
  P10["10 Initialization, Normalization, Residuals"]
  P11["11 Tokenization Theory + BPE Implementation"]
  P12["12 The Corpus"]
  P13["13 Embeddings & Representation Spaces"]
  P14["14 Pre-Transformer Sequence Models"]
  P15["15 Attention from Scratch"]
  P16["16 Positional Encodings"]
  P17["17 Tiny Transformer Block & Mini-GPT"]
  P18["18 Training Loop, Mixed Precision Preview, Checkpointing"]
  P19["19 Training Dynamics & Debugging"]
  P20["20 Evaluation Harness"]
  P21["21 Inference Internals & Sampling"]
  P22["22 KV Cache"]
  P23["23 GPU Architecture Fundamentals"]
  P24["24 CUDA & Triton Hands-On"]
  P25["25 PyTorch Internals"]
  P26["26 Quantization Deep Dive"]
  P27["27 Modern Attention Optimizations"]
  P28["28 Fine-Tuning, LoRA, QLoRA"]
  P29["29 Retrieval-Augmented Generation"]
  P30["30 Structured Generation & Constrained Decoding"]
  P31["31 Tool Use & the Model Context Protocol"]
  P32["32 Agents"]
  P33["33 Inference Serving"]
  P34["34 Observability, Cost & Capacity"]
  P35["35 Distributed Training & Inference"]
  P36["36 Frontier Architectures"]
  P37["37 Security & Safety of AI Systems"]
  P38["38 Cost, Capacity, Operations, MLOps"]
  P39["39 Capstone"]
  P40["40 Hardening, Postmortem, 'What's Next'"]
  P41["41 Learner Portal"]
  PX1["X1 Pretraining at Scale"]
  PX2["X2 Multi-Modal Models"]
  PX3["X3 RLHF / DPO / RLAIF"]
  PX4["X4 Hardware Deep-Dive"]
  PX5["X5 Interview Prep"]
  P00 --> P01
  P01 --> P02
  P02 --> P03
  P03 --> P04
  P04 --> P05
  P05 --> P06
  P04 --> P07
  P06 --> P07
  P07 --> P08
  P08 --> P09
  P09 --> P10
  P10 --> P11
  P11 --> P12
  P11 --> P13
  P12 --> P13
  P13 --> P14
  P13 --> P15
  P14 --> P15
  P15 --> P16
  P10 --> P17
  P15 --> P17
  P16 --> P17
  P12 --> P18
  P17 --> P18
  P18 --> P19
  P19 --> P20
  P20 --> P21
  P15 --> P22
  P21 --> P22
  P01 --> P23
  P22 --> P23
  P23 --> P24
  P08 --> P25
  P24 --> P25
  P02 --> P26
  P25 --> P26
  P15 --> P27
  P22 --> P27
  P26 --> P27
  P26 --> P28
  P27 --> P28
  P13 --> P29
  P28 --> P29
  P21 --> P30
  P29 --> P30
  P30 --> P31
  P29 --> P32
  P31 --> P32
  P22 --> P33
  P32 --> P33
  P33 --> P34
  P18 --> P35
  P34 --> P35
  P35 --> P36
  P29 --> P37
  P31 --> P37
  P36 --> P37
  P37 --> P38
  P38 --> P39
  P39 --> P40
  P40 --> P41
  P35 --> PX1
  P15 --> PX2
  P17 --> PX2
  P28 --> PX2
  P05 --> PX3
  P19 --> PX3
  P28 --> PX3
  P23 --> PX4
  P35 --> PX4
  P40 --> PX5

Phase index¶

Phase	Chapter	Requires	Unlocks	Teaches
00	Project Foundations & Learning Methodology	—	01	`reproducibility` `seeding` `lockfile` `pre-commit` `ci` `mypy`
01	Hardware & Computing Substrate	00	02 23	`memory-hierarchy` `roofline` `arithmetic-intensity` `cache` `latency` `bandwidth`
02	Numerical Representation	01	03 26	`ieee-754` `floating-point` `softmax-stability` `log-sum-exp` `precision` `denormals`
03	Linear Algebra from First Principles	02	04	`tensors` `matmul` `einsum` `svd` `rank` `norms`
04	Calculus & Optimization for AI	03	05 07	`gradients` `chain-rule` `backprop` `sgd` `momentum` `adam`
05	Probability & Information Theory	04	06 X3	`probability` `entropy` `kl-divergence` `cross-entropy` `mle` `perplexity`
06	Python for AI Engineering	05	07	`numpy` `strides` `broadcasting` `views` `vectorization` `profiling`
07	Scalar Autograd from Scratch (`minigrad`)	04 06	08	`autograd` `computation-graph` `reverse-mode` `topological-sort` `dag`
08	Tensor Autograd from Scratch	07	09 25	`tensor-autograd` `broadcasting-backward` `gradcheck` `matmul-grad` `softmax-grad`
09	Tiny MLP & Module Abstraction (`minitorch`)	08	10	`module-abstraction` `parameter-registration` `linear` `sequential` `optimizers`
10	Initialization, Normalization, Residuals	09	11 17	`initialization` `xavier` `kaiming` `layer-norm` `rms-norm` `residuals` `pre-ln`
11	Tokenization Theory + BPE Implementation	10	12 13	`tokenization` `bpe` `byte-level` `subwords` `vocabulary` `zipf`
12	The Corpus: Designing the Microscopic Dataset	11	13 18	`corpus-design` `enumeration` `stratified-split` `data-manifest` `reproducibility`
13	Embeddings & Representation Spaces	11 12	14 15 29	`embeddings` `cbow` `cosine-similarity` `dimensionality` `representation-geometry`
14	Pre-Transformer Sequence Models	13	15	`n-gram` `rnn` `lstm` `vanishing-gradient` `perplexity-baselines`
15	Attention from Scratch	13 14	16 17 22 27 X2	`attention` `scaled-dot-product` `multi-head` `causal-mask` `query-key-value`
16	Positional Encodings	15	17	`positional-encoding` `rope` `sinusoidal` `extrapolation`
17	Tiny Transformer Block & Mini-GPT	10 15 16	18 X2	`transformer-block` `pre-ln` `ffn` `gelu` `tied-embeddings` `lm-head`
18	Training Loop, Mixed Precision Preview, Checkpointing	12 17	19 35	`training-loop` `batching` `adamw` `warmup` `cosine-decay` `checkpointing`
19	Training Dynamics & Debugging	18	20 X3	`instrumentation` `hooks` `gradient-norms` `loss-curves` `debugging`
20	Evaluation Harness	19	21	`evaluation` `accuracy-probes` `calibration` `adversarial-eval` `bootstrap-ci`
21	Inference Internals & Sampling	20	22 30	`sampling` `temperature` `top-k` `top-p` `decode-cost`
22	KV Cache: From Math to Memory	15 21	23 27 33	`kv-cache` `prefill` `decode` `memory-bound` `arithmetic-intensity`
23	GPU Architecture Fundamentals	01 22	24 X4	`gpu-architecture` `sm` `warps` `occupancy` `hbm` `coalescing`
24	CUDA & Triton Hands-On	23	25	`cuda` `triton` `kernels` `shared-memory` `tiling`
25	PyTorch Internals	08 24	26	`dispatcher` `autograd-graph` `torch-compile` `custom-ops`
26	Quantization Deep Dive	02 25	27 28	`quantization` `int8` `nf4` `gptq` `gguf` `calibration`
27	Modern Attention Optimizations	15 22 26	28	`flash-attention` `online-softmax` `paged-attention` `gqa` `mqa`
28	Fine-Tuning, LoRA, QLoRA	26 27	29 X2 X3	`fine-tuning` `sft` `lora` `qlora` `catastrophic-forgetting`
29	Retrieval-Augmented Generation (RAG)	13 28	30 32 37	`rag` `chunking` `bm25` `hnsw` `hybrid-search` `reranking`
30	Structured Generation & Constrained Decoding	21 29	31	`structured-generation` `json-mode` `logit-masking` `grammar-constrained-decoding`
31	Tool Use & the Model Context Protocol (MCP)	30	32 37	`tool-use` `function-calling` `mcp` `json-schema` `json-rpc`
32	Agents: Planning, Memory, Sandboxing (Grammar Tutor)	29 31	33	`agents` `react-loop` `planning` `agent-memory` `sandboxing`
33	Inference Serving: From FastAPI to Continuous Batching	22 32	34	`serving` `continuous-batching` `scheduling` `littles-law` `load-testing`
34	Observability, Cost & Capacity	33	35	`observability` `red-metrics` `opentelemetry` `prometheus` `grafana` `cost-accounting`
35	Distributed Training & Inference	18 34	36 X1 X4	`distributed-training` `ddp` `fsdp` `tensor-parallel` `allreduce` `nccl`
36	Frontier Architectures	35	37	`mixture-of-experts` `mamba` `state-space-models` `speculative-decoding`
37	Security & Safety of AI Systems	29 31 36	38	`prompt-injection` `jailbreaks` `threat-modeling` `supply-chain-security` `red-teaming`
38	Cost, Capacity, Operations, MLOps	37	39	`mlops` `model-registry` `ab-testing` `canary-deploys` `drift-detection`
39	Capstone: The Miniature Production System	38	40	`integration` `end-to-end-demo` `architecture-diagrams` `cost-reporting`
40	Hardening, Postmortem, "What's Next"	39	41 X5	`postmortem` `hardening` `performance-audit` `synthesis`
41	Learner Portal: Delivering the Curriculum	40	—	`web-portal` `argon2id` `csrf` `spaced-repetition` `multi-tenancy`
X1	Pretraining at Scale	35	—	`scaling-laws` `chinchilla` `mfu` `pretraining` `cluster-economics`
X2	Multi-Modal Models	15 17 28	—	`vit` `clip` `whisper` `contrastive-learning` `multimodal-fusion`
X3	RLHF / DPO / RLAIF	05 19 28	—	`rlhf` `reward-modeling` `ppo` `dpo` `constitutional-ai`
X4	Hardware Deep-Dive	23 35	—	`h100` `nvlink` `allreduce` `interconnects` `datacenter-economics`
X5	Interview Prep	40	—	`whiteboarding` `systems-design` `paper-reading` `storytelling`