AI / ML Theory Roadmap

A dependency-aware study path from mathematical foundations to theory-facing machine learning topics.

Modified

April 26, 2026

Keywords

roadmap, machine learning theory, learning theory, optimization, high-dimensional statistics

1 Purpose

This roadmap is for readers who do not just want to use ML tools, but want to read theory-facing papers, understand assumptions, and know why the main guarantees look the way they do.

It is not a universal ML curriculum. It is a dependency-aware path through the mathematics that most often supports ML theory.

2 Who This Is For

Use this roadmap if your goal is any of the following:

read papers with proofs, bounds, asymptotics, or concentration arguments
understand why optimization and generalization claims are stated the way they are
move toward learning theory, high-dimensional statistics, or deep learning theory
connect the current site’s math modules to ML research directions

If your goal is only to get a working model pipeline quickly, this is probably too theory-heavy as a first route.

3 Main Sequence

Use this as the default order.

The first thirteen stages are now live on the site through High-Dimensional Statistics, with Matrix Analysis and High-Dimensional Probability feeding directly into a complete first-pass high-dimensional-statistics module. The site now also has a full Numerical Methods module for the computation side of that stack, which is enough to begin reading the cleaner end of ML-facing theory pages.

4 Why This Order Works

4.1 Proofs and Logic First

These pages train the habits that later theory papers assume without apology:

parse assumptions carefully
expose hidden quantifiers
translate between prose and symbolic structure
negate a statement correctly before trying to prove or refute it

4.2 Linear Algebra Before Most ML

ML is full of vectors, projections, low-rank structure, eigenmodes, and learned linear maps.

Without linear algebra, model descriptions become memorized recipes. With it, many architectures reduce to variations on a small number of reusable objects.

4.3 Probability Before Statistics

Theory-facing ML uses probability to talk about randomness, sampling, concentration, conditioning, and asymptotics.

Statistics then turns that language into estimators, validation, uncertainty, and generalization-facing ideas.

5 Shared Core Bridge Into ML

Before branching, there is one short shared core:

supervised learning setup
loss functions and empirical risk
train / validation / test logic
calibrated confidence and uncertainty
regularization and model complexity

Useful shared-core bridge pages are: