English · Español
Study any chapter¶
The ideal path is sequential — the dependency chain in LYNX_CORTEX.md §6 exists for a reason. But the curriculum also works as a reference work: every row links to the chapter, lists its direct conceptual prerequisites (informational, never locking), the phases it unlocks, and the concepts it teaches. Use site search or this table to jump straight to the topic you need.
Dependency graph¶
graph LR
P00["00 Project Foundations & Learning Methodology"]
P01["01 Hardware & Computing Substrate"]
P02["02 Numerical Representation"]
P03["03 Linear Algebra from First Principles"]
P04["04 Calculus & Optimization for AI"]
P05["05 Probability & Information Theory"]
P06["06 Python for AI Engineering"]
P07["07 Scalar Autograd from Scratch"]
P08["08 Tensor Autograd from Scratch"]
P09["09 Tiny MLP & Module Abstraction"]
P10["10 Initialization, Normalization, Residuals"]
P11["11 Tokenization Theory + BPE Implementation"]
P12["12 The Corpus"]
P13["13 Embeddings & Representation Spaces"]
P14["14 Pre-Transformer Sequence Models"]
P15["15 Attention from Scratch"]
P16["16 Positional Encodings"]
P17["17 Tiny Transformer Block & Mini-GPT"]
P18["18 Training Loop, Mixed Precision Preview, Checkpointing"]
P19["19 Training Dynamics & Debugging"]
P20["20 Evaluation Harness"]
P21["21 Inference Internals & Sampling"]
P22["22 KV Cache"]
P23["23 GPU Architecture Fundamentals"]
P24["24 CUDA & Triton Hands-On"]
P25["25 PyTorch Internals"]
P26["26 Quantization Deep Dive"]
P27["27 Modern Attention Optimizations"]
P28["28 Fine-Tuning, LoRA, QLoRA"]
P29["29 Retrieval-Augmented Generation"]
P30["30 Structured Generation & Constrained Decoding"]
P31["31 Tool Use & the Model Context Protocol"]
P32["32 Agents"]
P33["33 Inference Serving"]
P34["34 Observability, Cost & Capacity"]
P35["35 Distributed Training & Inference"]
P36["36 Frontier Architectures"]
P37["37 Security & Safety of AI Systems"]
P38["38 Cost, Capacity, Operations, MLOps"]
P39["39 Capstone"]
P40["40 Hardening, Postmortem, 'What's Next'"]
P41["41 Learner Portal"]
PX1["X1 Pretraining at Scale"]
PX2["X2 Multi-Modal Models"]
PX3["X3 RLHF / DPO / RLAIF"]
PX4["X4 Hardware Deep-Dive"]
PX5["X5 Interview Prep"]
P00 --> P01
P01 --> P02
P02 --> P03
P03 --> P04
P04 --> P05
P05 --> P06
P04 --> P07
P06 --> P07
P07 --> P08
P08 --> P09
P09 --> P10
P10 --> P11
P11 --> P12
P11 --> P13
P12 --> P13
P13 --> P14
P13 --> P15
P14 --> P15
P15 --> P16
P10 --> P17
P15 --> P17
P16 --> P17
P12 --> P18
P17 --> P18
P18 --> P19
P19 --> P20
P20 --> P21
P15 --> P22
P21 --> P22
P01 --> P23
P22 --> P23
P23 --> P24
P08 --> P25
P24 --> P25
P02 --> P26
P25 --> P26
P15 --> P27
P22 --> P27
P26 --> P27
P26 --> P28
P27 --> P28
P13 --> P29
P28 --> P29
P21 --> P30
P29 --> P30
P30 --> P31
P29 --> P32
P31 --> P32
P22 --> P33
P32 --> P33
P33 --> P34
P18 --> P35
P34 --> P35
P35 --> P36
P29 --> P37
P31 --> P37
P36 --> P37
P37 --> P38
P38 --> P39
P39 --> P40
P40 --> P41
P35 --> PX1
P15 --> PX2
P17 --> PX2
P28 --> PX2
P05 --> PX3
P19 --> PX3
P28 --> PX3
P23 --> PX4
P35 --> PX4
P40 --> PX5
Phase index¶
| Phase | Chapter | Requires | Unlocks | Teaches |
|---|---|---|---|---|
| 00 | Project Foundations & Learning Methodology | — | 01 | reproducibility seeding lockfile pre-commit ci mypy |
| 01 | Hardware & Computing Substrate | 00 | 02 23 | memory-hierarchy roofline arithmetic-intensity cache latency bandwidth |
| 02 | Numerical Representation | 01 | 03 26 | ieee-754 floating-point softmax-stability log-sum-exp precision denormals |
| 03 | Linear Algebra from First Principles | 02 | 04 | tensors matmul einsum svd rank norms |
| 04 | Calculus & Optimization for AI | 03 | 05 07 | gradients chain-rule backprop sgd momentum adam |
| 05 | Probability & Information Theory | 04 | 06 X3 | probability entropy kl-divergence cross-entropy mle perplexity |
| 06 | Python for AI Engineering | 05 | 07 | numpy strides broadcasting views vectorization profiling |
| 07 | Scalar Autograd from Scratch (minigrad) |
04 06 | 08 | autograd computation-graph reverse-mode topological-sort dag |
| 08 | Tensor Autograd from Scratch | 07 | 09 25 | tensor-autograd broadcasting-backward gradcheck matmul-grad softmax-grad |
| 09 | Tiny MLP & Module Abstraction (minitorch) |
08 | 10 | module-abstraction parameter-registration linear sequential optimizers |
| 10 | Initialization, Normalization, Residuals | 09 | 11 17 | initialization xavier kaiming layer-norm rms-norm residuals pre-ln |
| 11 | Tokenization Theory + BPE Implementation | 10 | 12 13 | tokenization bpe byte-level subwords vocabulary zipf |
| 12 | The Corpus: Designing the Microscopic Dataset | 11 | 13 18 | corpus-design enumeration stratified-split data-manifest reproducibility |
| 13 | Embeddings & Representation Spaces | 11 12 | 14 15 29 | embeddings cbow cosine-similarity dimensionality representation-geometry |
| 14 | Pre-Transformer Sequence Models | 13 | 15 | n-gram rnn lstm vanishing-gradient perplexity-baselines |
| 15 | Attention from Scratch | 13 14 | 16 17 22 27 X2 | attention scaled-dot-product multi-head causal-mask query-key-value |
| 16 | Positional Encodings | 15 | 17 | positional-encoding rope sinusoidal extrapolation |
| 17 | Tiny Transformer Block & Mini-GPT | 10 15 16 | 18 X2 | transformer-block pre-ln ffn gelu tied-embeddings lm-head |
| 18 | Training Loop, Mixed Precision Preview, Checkpointing | 12 17 | 19 35 | training-loop batching adamw warmup cosine-decay checkpointing |
| 19 | Training Dynamics & Debugging | 18 | 20 X3 | instrumentation hooks gradient-norms loss-curves debugging |
| 20 | Evaluation Harness | 19 | 21 | evaluation accuracy-probes calibration adversarial-eval bootstrap-ci |
| 21 | Inference Internals & Sampling | 20 | 22 30 | sampling temperature top-k top-p decode-cost |
| 22 | KV Cache: From Math to Memory | 15 21 | 23 27 33 | kv-cache prefill decode memory-bound arithmetic-intensity |
| 23 | GPU Architecture Fundamentals | 01 22 | 24 X4 | gpu-architecture sm warps occupancy hbm coalescing |
| 24 | CUDA & Triton Hands-On | 23 | 25 | cuda triton kernels shared-memory tiling |
| 25 | PyTorch Internals | 08 24 | 26 | dispatcher autograd-graph torch-compile custom-ops |
| 26 | Quantization Deep Dive | 02 25 | 27 28 | quantization int8 nf4 gptq gguf calibration |
| 27 | Modern Attention Optimizations | 15 22 26 | 28 | flash-attention online-softmax paged-attention gqa mqa |
| 28 | Fine-Tuning, LoRA, QLoRA | 26 27 | 29 X2 X3 | fine-tuning sft lora qlora catastrophic-forgetting |
| 29 | Retrieval-Augmented Generation (RAG) | 13 28 | 30 32 37 | rag chunking bm25 hnsw hybrid-search reranking |
| 30 | Structured Generation & Constrained Decoding | 21 29 | 31 | structured-generation json-mode logit-masking grammar-constrained-decoding |
| 31 | Tool Use & the Model Context Protocol (MCP) | 30 | 32 37 | tool-use function-calling mcp json-schema json-rpc |
| 32 | Agents: Planning, Memory, Sandboxing (Grammar Tutor) | 29 31 | 33 | agents react-loop planning agent-memory sandboxing |
| 33 | Inference Serving: From FastAPI to Continuous Batching | 22 32 | 34 | serving continuous-batching scheduling littles-law load-testing |
| 34 | Observability, Cost & Capacity | 33 | 35 | observability red-metrics opentelemetry prometheus grafana cost-accounting |
| 35 | Distributed Training & Inference | 18 34 | 36 X1 X4 | distributed-training ddp fsdp tensor-parallel allreduce nccl |
| 36 | Frontier Architectures | 35 | 37 | mixture-of-experts mamba state-space-models speculative-decoding |
| 37 | Security & Safety of AI Systems | 29 31 36 | 38 | prompt-injection jailbreaks threat-modeling supply-chain-security red-teaming |
| 38 | Cost, Capacity, Operations, MLOps | 37 | 39 | mlops model-registry ab-testing canary-deploys drift-detection |
| 39 | Capstone: The Miniature Production System | 38 | 40 | integration end-to-end-demo architecture-diagrams cost-reporting |
| 40 | Hardening, Postmortem, "What's Next" | 39 | 41 X5 | postmortem hardening performance-audit synthesis |
| 41 | Learner Portal: Delivering the Curriculum | 40 | — | web-portal argon2id csrf spaced-repetition multi-tenancy |
| X1 | Pretraining at Scale | 35 | — | scaling-laws chinchilla mfu pretraining cluster-economics |
| X2 | Multi-Modal Models | 15 17 28 | — | vit clip whisper contrastive-learning multimodal-fusion |
| X3 | RLHF / DPO / RLAIF | 05 19 28 | — | rlhf reward-modeling ppo dpo constitutional-ai |
| X4 | Hardware Deep-Dive | 23 35 | — | h100 nvlink allreduce interconnects datacenter-economics |
| X5 | Interview Prep | 40 | — | whiteboarding systems-design paper-reading storytelling |