Part I: Foundations

Beginner
Ch. 0
Statistics Supplement
Beginner
Ch. 1
Finding & Reading Data
Beginner
Ch. 2
Data Preprocessing
Beginner
Ch. 3
Resampling Methods
Beginner

Key Concepts

  • Mean, median, mode, variance, standard deviation, hypothesis testing
  • CSV parsing, string-to-float conversion, data I/O
  • Min-max normalization, z-score standardization
  • Train/test split, k-fold cross-validation

Part II: Evaluation Metrics

Beginner
Ch. 4
Evaluating Accuracy
Beginner
Ch. 5
Confusion Matrix
Beginner
Ch. 6
MAE and RMSE
Beginner
Ch. 7
Baseline Models
Beginner

Key Concepts

  • Accuracy, Precision, Recall, F1-Score, ROC curve, AUC
  • Multi-class confusion matrix interpretation
  • Mean Absolute Error, Root Mean Squared Error, R-squared
  • Random prediction baselines, ZeroR algorithm

Part III: Linear Models

Intermediate
Ch. 8
Linear Regression
Intermediate
Ch. 9
Stochastic Gradient Descent
Intermediate
Ch. 10
Logistic Regression
Intermediate

Key Concepts

  • OLS, covariance, correlation, Ridge/Lasso regularization
  • SGD algorithm, learning rate, convergence
  • Sigmoid function, maximum likelihood, binary classification

Part IV: Classic ML Algorithms

Intermediate
Ch. 11
Perceptron
Intermediate
Ch. 12
Decision Trees (CART)
Intermediate
Ch. 13
Naive Bayes
Intermediate
Ch. 14
K-Nearest Neighbors
Intermediate
Ch. 15
Learning Vector Quantization
Intermediate

Part V: Neural Networks & Unsupervised

Ch. 16
ANN & Backpropagation
Intermediate
Ch. 17
K-Means Clustering
Intermediate
Ch. 18
PCA
Intermediate
Ch. 19
Support Vector Machine
Intermediate

Part VI: Ensemble Methods

Bonus
Ensemble Algorithms
Intermediate

Bootstrap, bagging, random forests, boosting concepts - understanding how combining weak learners creates strong predictors.

Part VII: Advanced ML (Industry Track)

Advanced
Ch. 20
Feature Engineering
Advanced
Ch. 21
Hyperparameter Optimization
Advanced
Ch. 22
Gradient Boosting
Advanced
Ch. 23
Time Series Forecasting
Advanced
Ch. 24
Recommender Systems
Advanced
Ch. 25
ML Engineering & MLOps
Advanced

Part VIII: Deep Learning Track

Advanced
Ch. 26
Deep Learning Fundamentals
Advanced
Ch. 27
CNN & Computer Vision
Advanced
Ch. 28
Sequence Models & Transformers
Advanced
Ch. 29
DL Operations
Advanced

Key Concepts

  • Convolution layers, transfer learning, data augmentation
  • Self-attention, transformer blocks, fine-tuning workflow
  • AdamW, learning rate schedules, checkpointing, reproducibility

Part IX: Advanced Algorithms (Expert Track)

Expert
Ch. 30
Random Forest Advanced
Advanced
Ch. 31
AdaBoost Classification
Advanced
Ch. 32
GMM & EM Algorithm
Advanced
Ch. 33
DBSCAN & Hierarchical
Advanced
Ch. 34
ARIMA & Smoothing
Advanced
Ch. 35
Hidden Markov Models
Expert
Ch. 36
Autoencoders from Scratch
Advanced
Ch. 37
Q-Learning (RL)
Advanced

Part X: HuggingFace + Kaggle Master Track

Advanced
Ch. 38
Tokenization & BPE
Advanced
Ch. 39
Attention & Transformer Internals
Expert
Ch. 40
PEFT & LoRA Fine-Tuning
Expert
Ch. 41
RAG & Vector Search
Advanced
Ch. 42
Kaggle EDA Playbook
Advanced
Ch. 43
Feature Engineering Advanced
Expert
Ch. 44
Ensembling & Stacking
Expert
Ch. 45
Model Monitoring & Drift
Advanced

Part XI: Frontier Algorithms Track

Expert
Ch. 46
SARSA & DQN Concepts
Expert
Ch. 47
BM25 & Ranking
Expert
Ch. 48
HNSW-Style ANN Search
Expert
Ch. 49
Bayesian Optimization
Expert
Ch. 50
Causal Uplift Modeling
Expert

Parts XII-XIII: Research-to-Production

Expert
Ch. 51
PPO & Policy Optimization
Expert
Ch. 52
Contrastive Learning (SimCLR)
Expert
Ch. 53
Graph Neural Networks
Expert
Ch. 54
Calibration & Uncertainty
Expert
Ch. 55
A/B Testing & Experiments
Expert
Ch. 56-60
Production Systems
Expert

Distributed training, feature stores, online learning, recommender ranking losses, and causal bandits with Thompson sampling.

Parts XIV-XV: Evaluation & Governance

Ch. 61
Retrieval Eval (MRR, NDCG)
Expert
Ch. 62
Knowledge Distillation
Expert
Ch. 63
Probabilistic Forecasting
Expert
Ch. 64-65
Multi-Objective Ranking & MLOps
Expert
Ch. 66
Advanced Anomaly Detection
Expert
Ch. 67
Survival Analysis
Expert
Ch. 68-70
Graph RecSys & Governance
Expert

Part XVI: Causal & Rollout Intelligence

Ch. 71
Causal Discovery
Expert
Ch. 72
Robust Optimization
Expert
Ch. 73
Synthetic Data Generation
Expert
Ch. 74
Advanced Experiment Design
Expert
Ch. 75
Rollback Orchestration
Expert

Part XVII: Advanced Boosting & Generative Systems

Expert
Ch. 76
XGBoost-Style 2nd Order Boosting
Expert
Ch. 77
CatBoost-Style Encoding
Expert
Ch. 78
HuggingFace-Style Trainer
Expert
Ch. 79
Adversarial Validation
Expert
Ch. 80
Diffusion Models (DDPM)
Expert

Key Concepts

  • Second-order split gain, Hessian weighting, leaf regularization
  • Ordered target encoding, high-cardinality categorical handling
  • AdamW internals, warmup-cosine scheduling, gradient clipping
  • Train-test domain classification, drift feature ranking
  • Forward noising schedule, reverse denoising, sampling diagnostics