Table of Contents
Also see the outline of the entire book as planned, including draft chapters that are not yet completed.
-
Build DeepSeek from ScratchA comprehensive hands-on guide covering every major DeepSeek innovation — from KV Cache and Multi-Head Latent Attention to Mixture-of-Experts, Multi-Token Prediction, FP8 Quantization, Sparse Attention, Manifold-Constrained Hyper-Connections, Conditional Memory (Engram), and the million-token DeepSeek-V4 architecture.
-
-
Chapter 19. Membedah DeepSeek-V4
-
Chapter 20. DeepSeek-v4 beyond basics
-
Chapter 21. DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models
-
Chapter 22. Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
-
Chapter 23. mHC: Manifold-Constrained Hyper-Connections
-
Chapter 24. DeepSeek-V4
-
-
010 References