A structured, curated roadmap for university students and self-taught developers who want to understand the math behind Machine Learning β not just call
.fit()and hope for the best.
Most ML roadmaps hand you a library list and call it a day. This one is different. Every resource here either shows the math running, visualises what the equations mean geometrically, or lets you work through calculations manually β because the only way to truly understand an algorithm is to execute it yourself, by hand, at least once.
This roadmap is structured as a learning path, not a link dump. Work through it top to bottom and you will go from shaky algebra to confidently implementing and explaining ML algorithms from scratch.
- How to Use This Roadmap
- Stage 1 β Linear Algebra
- Stage 2 β Calculus & Optimisation
- Stage 3 β Probability & Statistics
- Stage 4 β Core ML Algorithms
- Interactive Intuition Tools
- Visual Algorithm Explainers
- Practice Datasets
- Cheatsheets & Quick References
- Contributing
This is not a "collect them all" resource list. It is a sequential curriculum. Each stage unlocks the next.
Linear Algebra β Calculus β Probability β Algorithms β Build Things
ββββββββββ Loop back when something breaks βββββββββββ
If you are a complete beginner: Start at Stage 1. Do not skip it. The reason most developers hit a wall with ML is not the code β it is that they never built a working mental model of matrix operations and vector spaces.
If you have some background: Use the Table of Contents to jump to your weakest stage. Be honest with yourself about which stage that is.
For exam preparation specifically: Jump directly to the Interactive Intuition Tools section β those resources are built for manually executing algorithms the way a university exam requires.
The language of ML. If you cannot multiply matrices in your head, you cannot reason about what a neural network is doing.
What you need to be able to do before moving on: Perform matrix multiplication by hand. Understand what a dot product measures geometrically. Explain eigenvectors without using the word "eigenvectors." Apply these to understand why PCA works.
-
3Blue1Brown: Essence of Linear Algebra β The single best starting point for anyone who has "done" linear algebra but never understood it. Each chapter reframes an abstract operation as a geometric transformation. Watch chapters 1β9 before touching any other resource. Free, 15 hours total, no prerequisites.
-
Immersive Math β Interactive Linear Algebra β A peer-reviewed, fully interactive linear algebra textbook. Every figure is a manipulable simulation. When 3B1B explains the concept, this is where you go to poke at it. Covers dot products, matrix transformations, eigenvalues, SVD.
-
MIT 18.06 β Gilbert Strang's Linear Algebra β The canonical university course, free on MIT OpenCourseWare. Strang's lectures are unusually clear for a pure mathematics course. Use this alongside 3B1B once you want formal rigour alongside intuition. Problem sets are included.
-
Khan Academy: Linear Algebra β The safest on-ramp if MIT 18.06 feels too steep initially. Covers vectors, matrix operations, transformations, and inverses in a self-paced, fully free format.
-
Computational Linear Algebra β fast.ai β A practical, code-first companion to the theoretical material above. Implements the same concepts in Python/NumPy. Use after the theoretical resources, not before.
You do not need a full calculus course. You need derivatives, the chain rule, and gradient descent β understood well enough to derive backpropagation on a whiteboard.
What you need to be able to do before moving on: Compute partial derivatives. Explain what a gradient is and why we move in the opposite direction. Derive the update rule for linear regression using gradient descent from scratch.
-
3Blue1Brown: Essence of Calculus β Eleven videos. Builds the conceptual foundation for derivatives, integrals, and the chain rule entirely from geometric intuition. If you have only one day to spend on calculus before diving into ML, this is how to spend it.
-
Khan Academy: Multivariable Calculus β The relevant sections for ML are partial derivatives and gradient vectors. Covers these at exactly the right depth β rigorous enough to apply, accessible enough to not require a mathematics degree.
-
Andrej Karpathy: micrograd β A 150-line implementation of backpropagation from scratch. The most efficient way to learn how automatic differentiation and the chain rule connect to real neural network training. Reading this code carefully is worth more than a week of calculus lectures.
-
The Matrix Calculus You Need For Deep Learning β A focused, 30-page paper by Parr and Howard. Covers exactly the matrix calculus operators that appear in backpropagation derivations and nothing else. Bookmark this as a permanent reference.
Bayesian thinking is the core of half of ML. The other half is optimisation over probability distributions. This stage cannot be skipped.
What you need to be able to do before moving on: Derive Bayes' theorem from first principles. Explain the difference between a probability and a likelihood. Compute entropy and information gain by hand for a small dataset.
-
StatQuest with Josh Starmer β The single most-cited resource for probability and statistics in ML. Start with the Probability and Statistics Fundamentals playlist, then the Machine Learning playlist in order.
-
Seeing Theory β Brown University β An interactive, visual introduction to probability theory. Covers Bayesian inference, probability distributions, and frequentist vs. Bayesian statistics as manipulable simulations.
-
Think Bayes β Allen Downey β A free, computational introduction to Bayesian statistics using Python. Solves real problems with Bayesian methods, making the abstraction immediately concrete. No paywall.
-
Information Theory, Inference, and Learning Algorithms β MacKay β The graduate-level reference for entropy, KL divergence, and the information-theoretic foundations of ML algorithms. Chapter 2 alone is worth a full read. Free PDF from the author.
These are the algorithms that appear in every ML course, every technical interview, and every exam. Understanding them mathematically β not just through an API call β is what separates engineers who can debug and extend models from those who cannot.
Coverage: K-Nearest Neighbours, NaΓ―ve Bayes, Decision Trees (ID3), Linear & Multiple Regression, K-Means Clustering, Minimax, Alpha-Beta Pruning, Genetic Algorithms.
-
Distill.pub β Peer-reviewed ML research with interactive diagrams. The articles on attention and feature visualisation are widely cited in academic literature.
-
Stanford CS229 β Machine Learning β Andrew Ng's original Stanford ML course. The lecture notes derive every algorithm from mathematical first principles, including full proofs for NaΓ―ve Bayes and SVM. Free.
-
Berkeley CS188 β Artificial Intelligence β The standard reference for Minimax, Alpha-Beta Pruning, and probabilistic reasoning. Past exams are publicly available.
-
Andrej Karpathy: Neural Networks Zero to Hero β Builds a neural network from an empty Python file, deriving every component from calculus and probability.
Small, clean datasets for practicing manual calculations β not the massive real-world sets used for training models.
-
UCI ML Repository β The definitive source for small, clean classification and regression datasets ideal for working through algorithms by hand (Iris, Car Evaluation, etc.).
-
Kaggle Datasets β Filter by file size < 1MB to find datasets appropriate for manual practice.
-
Stanford CS229 Cheatsheets β Shervine Amidi's legendary summary sheets for the Stanford ML course. Dense, accurate, and used by students worldwide.
-
aml-cheat-sheet β A visual, printable cheatsheet for Applied Machine Learning covering the most-tested algorithms in university courses.
Contributions are welcome and encouraged.
This list has a strict quality bar: a resource must be free, primarily visual or interactive, and genuinely help a student understand how an algorithm works β not just that it works.
Please read CONTRIBUTING.md before opening a PR. Check that your suggestion is not already listed or pending in open issues.
Maintained by the open-source community. Not affiliated with any of the listed projects. Inspired by the Awesome project.