Recommended Reading List (Textbooks)

Many people have asked me in person about pointers to good books for ramp-up getting into the field. I’ve casually passed around lists quite often, but I thought I’d share it here.

Below I will begin compiling a list of books (though some may simply be manuscripts from professors) that are well known, read, and/or cited for Ph.D. students to grip the noteworthy theories and practices. I will update this list frequently so please feel free to come back often. (Here I’ve temporarily stripped the correct citation format to be edited later.)

“Birds-eye-view” textbooks:

Pattern Recognition and Machine Learning. Christopher Bishop.

Machine Learning: A Probabilistic Perspective. Kevin P. Murphy.

Advanced Data Analysis from an Elementary Point of View. Cosma Rohilla Shalizi.

Subject-focused textbooks / manuscripts:

Graphical Models

Graphical Models, Exponential Families, and Variational Inference. Martin J. Wainwright, Michael I. Jordan.

An Introduction to Conditional Random Fields. Charles Sutton, Andrew McCallum.

Discrete Models

Categorical Data Analysis. Alan Agresti.


Introductory Lectures on Convex Optimization. Yurii Nesterov.

Convex Optimization. Stephen Boyd, Lieven Vandenberghe.

Deep Learning

Deep Learning. Ian Goodfellow, Yoshua Bengio, Aaron Courville.

Probability Theory / Measure Theory

Introduction to Probability Models. Sheldon M. Ross.

Measure Theory and Fine Properties of Functions. Lawrence Craig Evans, Ronald F. Gariepy

Probability Essentials. Jean Jacod, Philip Protter.

Probabilistic Symmetries and Invariance Principles. Olav Kallenberg.

Stochastic Process / Stochastic Differential Equations

Poisson Processes. J. F. C. Kingman.

Stochastic Methods. Crispin Gardiner.

An Introduction to Stochastic Differential Equations. Lawrence Craig Evans.

Stochastic Differential Equations: An Introduction with Applications.Bernt Øksendal.

Determinantal point processes for machine learning. Alex Kulesza, Ben Taskar.

Gaussian Processes for Machine Learning. Carl Edward Rasmussen, Christopher K. I. Williams.

Optimal Transport

Computational Optimal Transport. Gabriel Peyré, Marco Cuturi.

Linear Algebra


Real Analysis


Complex Analysis


Functional Analysis

Ordinary / Partial Differential Equations

Partial Differential Equations. Lawrence Craig Evans.

Differential Geometry

A Comprehensive Introduction to Differential Geometry. Volume One, Two and Three. Michael Spivak.

Statistical Inference

Statistical Inference. George Casella, Roger L. Berger.

Testing Statistical Hypotheses. Erich L. Lehmann, Joseph P. Romano.

Semiparametric Theory and Missing Data. Anastasios A. Tsiatis.

Computer Age Statistical Inference: Algorithms, Evidence and Data Science. Bradley Efron, Trevor Hastie.

The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Trevor Hastie, Robert Tibshirani, Jerome Friedman.

Bayesian Statistics

Bayesian Data Analysis. Andrew Gelman, John B. Carlin, Hal S. Stern, David B. Dunson, Aki Vehtari, Donald B. Rubin.

Bayesian Approximate Inference

Handbook of Markov Chain Monte Carlo. Steve Brooks, Andrew Gelman, Galin L. Jones, Xiao-Li Meng.

Reinforcement Learning

Reinforcement Learning: An Introduction. Richard S. Sutton, Andrew G. Barto.

Causal Inference

Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. Guido W. Imbens, Donald B. Rubin.

Causality: Models, Reasoning and Inference. Judea Pearl.

Counterfactuals and Causal Inference: Methods and Principles for Social Research. Christopher Morgan, Stephen Winship.

Causal Inference. Miguel A. Hernán, James M. Robins.

Elements of Causal Inference: Foundations and Learning Algorithms. Jonas Peters, Dominik Janzing, Bernhard Schölkopf.

Information Retrieval

Information Retrieval. Christopher Manning, Prabhakar Raghavan, Hinrich Schütze.

Data Mining



Mostly Harmless Econometrics: An Empiricist’s Companion. Joshua D. Angrist, Jörn-Steffen Pischke.

Mathematical & Computational Finance


Quantum Physics / Chemistry


Algebraic & Computational Game Theory