Learning Curriculum - AI Algo Hub

Part I: Foundations

Beginner

Ch. 0

Statistics Supplement

Beginner

Ch. 1

Finding & Reading Data

Beginner

Ch. 2

Data Preprocessing

Beginner

Ch. 3

Resampling Methods

Beginner

Key Concepts

Mean, median, mode, variance, standard deviation, hypothesis testing
CSV parsing, string-to-float conversion, data I/O
Min-max normalization, z-score standardization
Train/test split, k-fold cross-validation

Part II: Evaluation Metrics

Beginner

Ch. 4

Evaluating Accuracy

Beginner

Ch. 5

Confusion Matrix

Beginner

Ch. 6

MAE and RMSE

Beginner

Ch. 7

Baseline Models

Beginner

Key Concepts

Accuracy, Precision, Recall, F1-Score, ROC curve, AUC
Multi-class confusion matrix interpretation
Mean Absolute Error, Root Mean Squared Error, R-squared
Random prediction baselines, ZeroR algorithm

Part III: Linear Models

Intermediate

Ch. 8

Linear Regression

Intermediate

Ch. 9

Stochastic Gradient Descent

Intermediate

Ch. 10

Logistic Regression

Intermediate

Key Concepts

OLS, covariance, correlation, Ridge/Lasso regularization
SGD algorithm, learning rate, convergence
Sigmoid function, maximum likelihood, binary classification

Part IV: Classic ML Algorithms

Intermediate

Ch. 11

Perceptron

Intermediate

Ch. 12

Decision Trees (CART)

Intermediate

Ch. 13

Naive Bayes

Intermediate

Ch. 14

K-Nearest Neighbors

Intermediate

Ch. 15

Learning Vector Quantization

Intermediate

Part V: Neural Networks & Unsupervised

Ch. 16

ANN & Backpropagation

Intermediate

Ch. 17

K-Means Clustering

Intermediate

Ch. 18

PCA

Intermediate

Ch. 19

Support Vector Machine

Intermediate

Part VI: Ensemble Methods

Bonus

Ensemble Algorithms

Intermediate

Bootstrap, bagging, random forests, boosting concepts - understanding how combining weak learners creates strong predictors.

Part VII: Advanced ML (Industry Track)

Advanced

Ch. 20

Feature Engineering

Advanced

Ch. 21

Hyperparameter Optimization

Advanced

Ch. 22

Gradient Boosting

Advanced

Ch. 23

Time Series Forecasting

Advanced

Ch. 24

Recommender Systems

Advanced

Ch. 25

ML Engineering & MLOps

Advanced

Part VIII: Deep Learning Track

Advanced

Ch. 26

Deep Learning Fundamentals

Advanced

Ch. 27

CNN & Computer Vision

Advanced

Ch. 28

Sequence Models & Transformers

Advanced

Ch. 29

DL Operations

Advanced

Key Concepts

Convolution layers, transfer learning, data augmentation
Self-attention, transformer blocks, fine-tuning workflow
AdamW, learning rate schedules, checkpointing, reproducibility

Part IX: Advanced Algorithms (Expert Track)

Expert

Ch. 30

Random Forest Advanced

Advanced

Ch. 31

AdaBoost Classification

Advanced

Ch. 32

GMM & EM Algorithm

Advanced

Ch. 33

DBSCAN & Hierarchical

Advanced

Ch. 34

ARIMA & Smoothing

Advanced

Ch. 35

Hidden Markov Models

Expert

Ch. 36

Autoencoders from Scratch

Advanced

Ch. 37

Q-Learning (RL)

Advanced

Part X: HuggingFace + Kaggle Master Track

Advanced

Ch. 38

Tokenization & BPE

Advanced

Ch. 39

Attention & Transformer Internals

Expert

Ch. 40

PEFT & LoRA Fine-Tuning

Expert

Ch. 41

RAG & Vector Search

Advanced

Ch. 42

Kaggle EDA Playbook

Advanced

Ch. 43

Feature Engineering Advanced

Expert

Ch. 44

Ensembling & Stacking

Expert

Ch. 45

Model Monitoring & Drift

Advanced

Part XI: Frontier Algorithms Track

Expert

Ch. 46

SARSA & DQN Concepts

Expert

Ch. 47

BM25 & Ranking

Expert

Ch. 48

HNSW-Style ANN Search

Expert

Ch. 49

Bayesian Optimization

Expert

Ch. 50

Causal Uplift Modeling

Expert

Parts XII-XIII: Research-to-Production

Expert

Ch. 51

PPO & Policy Optimization

Expert

Ch. 52

Contrastive Learning (SimCLR)

Expert

Ch. 53

Graph Neural Networks

Expert

Ch. 54

Calibration & Uncertainty

Expert

Ch. 55

A/B Testing & Experiments

Expert

Ch. 56-60

Production Systems

Expert

Distributed training, feature stores, online learning, recommender ranking losses, and causal bandits with Thompson sampling.

Parts XIV-XV: Evaluation & Governance

Ch. 61

Retrieval Eval (MRR, NDCG)

Expert

Ch. 62

Knowledge Distillation

Expert

Ch. 63

Probabilistic Forecasting

Expert

Ch. 64-65

Multi-Objective Ranking & MLOps

Expert

Ch. 66

Advanced Anomaly Detection

Expert

Ch. 67

Survival Analysis

Expert

Ch. 68-70

Graph RecSys & Governance

Expert

Part XVI: Causal & Rollout Intelligence

Ch. 71

Causal Discovery

Expert

Ch. 72

Robust Optimization

Expert

Ch. 73

Synthetic Data Generation

Expert

Ch. 74

Advanced Experiment Design

Expert

Ch. 75

Rollback Orchestration

Expert

Part XVII: Advanced Boosting & Generative Systems

Expert

Ch. 76

XGBoost-Style 2nd Order Boosting

Expert

Ch. 77

CatBoost-Style Encoding

Expert

Ch. 78

HuggingFace-Style Trainer

Expert

Ch. 79

Adversarial Validation

Expert

Ch. 80

Diffusion Models (DDPM)

Expert

Key Concepts

Second-order split gain, Hessian weighting, leaf regularization
Ordered target encoding, high-cardinality categorical handling
AdamW internals, warmup-cosine scheduling, gradient clipping
Train-test domain classification, drift feature ranking
Forward noising schedule, reverse denoising, sampling diagnostics