🧠 ML Math Roadmap: From Fundamentals to Algorithms

A structured, curated roadmap for university students and self-taught developers who want to understand the math behind Machine Learning — not just call .fit() and hope for the best.

Most ML roadmaps hand you a library list and call it a day. This one is different. Every resource here either shows the math running, visualises what the equations mean geometrically, or lets you work through calculations manually — because the only way to truly understand an algorithm is to execute it yourself, by hand, at least once.

This roadmap is structured as a learning path, not a link dump. Work through it top to bottom and you will go from shaky algebra to confidently implementing and explaining ML algorithms from scratch.

🗺 How to Use This Roadmap

This is not a "collect them all" resource list. It is a sequential curriculum. Each stage unlocks the next.

Linear Algebra → Calculus → Probability → Algorithms → Build Things

└───────── Loop back when something breaks ──────────┘

If you are a complete beginner: Start at Stage 1. Do not skip it. The reason most developers hit a wall with ML is not the code — it is that they never built a working mental model of matrix operations and vector spaces.

If you have some background: Use the Table of Contents to jump to your weakest stage. Be honest with yourself about which stage that is.

For exam preparation specifically: Jump directly to the Interactive Intuition Tools section — those resources are built for manually executing algorithms the way a university exam requires.

📐 Stage 1 — Linear Algebra

The language of ML. If you cannot multiply matrices in your head, you cannot reason about what a neural network is doing.

What you need to be able to do before moving on: Perform matrix multiplication by hand. Understand what a dot product measures geometrically. Explain eigenvectors without using the word "eigenvectors." Apply these to understand why PCA works.

Core Resources

3Blue1Brown: Essence of Linear Algebra — The single best starting point for anyone who has "done" linear algebra but never understood it. Each chapter reframes an abstract operation as a geometric transformation. Watch chapters 1–9 before touching any other resource. Free, 15 hours total, no prerequisites.
Immersive Math — Interactive Linear Algebra — A peer-reviewed, fully interactive linear algebra textbook. Every figure is a manipulable simulation. When 3B1B explains the concept, this is where you go to poke at it. Covers dot products, matrix transformations, eigenvalues, SVD.
MIT 18.06 — Gilbert Strang's Linear Algebra — The canonical university course, free on MIT OpenCourseWare. Strang's lectures are unusually clear for a pure mathematics course. Use this alongside 3B1B once you want formal rigour alongside intuition. Problem sets are included.
Khan Academy: Linear Algebra — The safest on-ramp if MIT 18.06 feels too steep initially. Covers vectors, matrix operations, transformations, and inverses in a self-paced, fully free format.
Computational Linear Algebra — fast.ai — A practical, code-first companion to the theoretical material above. Implements the same concepts in Python/NumPy. Use after the theoretical resources, not before.

📉 Stage 2 — Calculus & Optimisation

You do not need a full calculus course. You need derivatives, the chain rule, and gradient descent — understood well enough to derive backpropagation on a whiteboard.

What you need to be able to do before moving on: Compute partial derivatives. Explain what a gradient is and why we move in the opposite direction. Derive the update rule for linear regression using gradient descent from scratch.

Core Resources

3Blue1Brown: Essence of Calculus — Eleven videos. Builds the conceptual foundation for derivatives, integrals, and the chain rule entirely from geometric intuition. If you have only one day to spend on calculus before diving into ML, this is how to spend it.
Khan Academy: Multivariable Calculus — The relevant sections for ML are partial derivatives and gradient vectors. Covers these at exactly the right depth — rigorous enough to apply, accessible enough to not require a mathematics degree.
Andrej Karpathy: micrograd — A 150-line implementation of backpropagation from scratch. The most efficient way to learn how automatic differentiation and the chain rule connect to real neural network training. Reading this code carefully is worth more than a week of calculus lectures.
The Matrix Calculus You Need For Deep Learning — A focused, 30-page paper by Parr and Howard. Covers exactly the matrix calculus operators that appear in backpropagation derivations and nothing else. Bookmark this as a permanent reference.

🎲 Stage 3 — Probability & Statistics

Bayesian thinking is the core of half of ML. The other half is optimisation over probability distributions. This stage cannot be skipped.

What you need to be able to do before moving on: Derive Bayes' theorem from first principles. Explain the difference between a probability and a likelihood. Compute entropy and information gain by hand for a small dataset.

Core Resources

StatQuest with Josh Starmer — The single most-cited resource for probability and statistics in ML. Start with the Probability and Statistics Fundamentals playlist, then the Machine Learning playlist in order.
Seeing Theory — Brown University — An interactive, visual introduction to probability theory. Covers Bayesian inference, probability distributions, and frequentist vs. Bayesian statistics as manipulable simulations.
Think Bayes — Allen Downey — A free, computational introduction to Bayesian statistics using Python. Solves real problems with Bayesian methods, making the abstraction immediately concrete. No paywall.
Information Theory, Inference, and Learning Algorithms — MacKay — The graduate-level reference for entropy, KL divergence, and the information-theoretic foundations of ML algorithms. Chapter 2 alone is worth a full read. Free PDF from the author.

🤖 Stage 4 — Core ML Algorithms

These are the algorithms that appear in every ML course, every technical interview, and every exam. Understanding them mathematically — not just through an API call — is what separates engineers who can debug and extend models from those who cannot.

Coverage: K-Nearest Neighbours, Naïve Bayes, Decision Trees (ID3), Linear & Multiple Regression, K-Means Clustering, Minimax, Alpha-Beta Pruning, Genetic Algorithms.

Conceptual Foundation

Distill.pub — Peer-reviewed ML research with interactive diagrams. The articles on attention and feature visualisation are widely cited in academic literature.
Stanford CS229 — Machine Learning — Andrew Ng's original Stanford ML course. The lecture notes derive every algorithm from mathematical first principles, including full proofs for Naïve Bayes and SVM. Free.
Berkeley CS188 — Artificial Intelligence — The standard reference for Minimax, Alpha-Beta Pruning, and probabilistic reasoning. Past exams are publicly available.
Andrej Karpathy: Neural Networks Zero to Hero — Builds a neural network from an empty Python file, deriving every component from calculus and probability.

📦 Practice Datasets

Small, clean datasets for practicing manual calculations — not the massive real-world sets used for training models.

UCI ML Repository — The definitive source for small, clean classification and regression datasets ideal for working through algorithms by hand (Iris, Car Evaluation, etc.).
Kaggle Datasets — Filter by file size < 1MB to find datasets appropriate for manual practice.

📝 Cheatsheets & References

Stanford CS229 Cheatsheets — Shervine Amidi's legendary summary sheets for the Stanford ML course. Dense, accurate, and used by students worldwide.
aml-cheat-sheet — A visual, printable cheatsheet for Applied Machine Learning covering the most-tested algorithms in university courses.

Contributing

Contributions are welcome and encouraged.

This list has a strict quality bar: a resource must be free, primarily visual or interactive, and genuinely help a student understand how an algorithm works — not just that it works.

Please read CONTRIBUTING.md before opening a PR. Check that your suggestion is not already listed or pending in open issues.

_{Maintained by the open-source community.
Not affiliated with any of the listed projects.
Inspired by the Awesome project.}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 ML Math Roadmap: From Fundamentals to Algorithms

📋 Table of Contents

🗺 How to Use This Roadmap

📐 Stage 1 — Linear Algebra

Core Resources

📉 Stage 2 — Calculus & Optimisation

Core Resources

🎲 Stage 3 — Probability & Statistics

Core Resources

🤖 Stage 4 — Core ML Algorithms

Conceptual Foundation

📦 Practice Datasets

📝 Cheatsheets & References

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

🧠 ML Math Roadmap: From Fundamentals to Algorithms

📋 Table of Contents

🗺 How to Use This Roadmap

📐 Stage 1 — Linear Algebra

Core Resources

📉 Stage 2 — Calculus & Optimisation

Core Resources

🎲 Stage 3 — Probability & Statistics

Core Resources

🤖 Stage 4 — Core ML Algorithms

Conceptual Foundation

📦 Practice Datasets

📝 Cheatsheets & References

Contributing

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages