Some good readings

And a brief overview of each, written by myself.

Foundations
Research Papers
- Artificial Intelligence
- Distributed Systems

Also take a look at the MIT Probabilistic Computing Project's Reading List

Foundations

Mathematics

Introduction to Mathematical Logic: by Alonzo Church
Set Theory and Its Logic: by Willard Van Orman Quine
The Calculi of Lambda-Conversion: by Alonzo Church
Introduction to Probability: by Dimitri P. Bertsekas & John N. Tsitsiklis
Markov Chains and Mixing Times [pdf]: by David A. Levin & Yuval Peres

Computation

Introduction to the Theory of Computation: by Michael Sipser
Introduction to Algorithms: by Thomas H. Cormen, Ronald L. Rivest, Charles E. Leiserson, & Clifford Stien
The Structure and Interpretation of Computer Programs: by Harold Abelson, Gerald Sussman, & Julie Sussman
Principles of Computer System Design: by Jerome H. Saltzer & M. Franz Kaashoek
Artifical Intelligence: A Modern Approach: by Stuart Russel & Peter Norvig
Elements of Statistical Learning: Data Mining, Inference, and Prediction: by Trevor Hastie, Robert Tibshirani, & Jerome Friedman
Machine Learning: A Probabilistic Perspective: by Kevin P. Murphy
Reinforcement Learning: An Introduction [pdf]: by Richard S. Sutton & Andrew G. Barto
Deep Learning in Neural Networks: An Overview [arXiv]: by Jürgen Schmidhuber

Natural Intelligence

Society of Mind: by Marvin Minky
Gödel, Escher, Bach: An Eternal Golden Braid: by Douglas R. Hofstadter
The Origin of Concepts: by Susan Carey

Research Papers

Artificial Intelligence

Combining Q-Learning and Search with Amortized Value Estimates [arXiv]: (2019) Hamrick, J.B., Bapst, V., Sanchez-Gonzalez, A., Pfaff, T., Weber, T., Buesing, L., & Battaglia, P.W.
Write, Execute, Assess: Program Synthesis with a REPL [arXiv]: (2019) Ellis, K., Nye, M.I., Pu, Y., Sosa, F., Tenenbaum, J.B., & Solar-Lezama, A.
Synthetic Datasets for Neural Program Synthesis [pdf]: (2019) Shin, R., Kant, N., Gupta, K., Bender, C., Trabucco, B., Singh, R., & Song, D.X.
Relational inductive biases, deep learning, and graph networks [arXiv]: (2018) Battaglia, P.W., Hamrick, J.B., Bapst, V., Sanchez-Gonzalez, A., Zambaldi, V.F., Malinowski, M., Tacchetti, A., Raposo, D., Santoro, A., Faulkner, R., Gülçehre, Ç., Song, H.F., Ballard, A.J., Gilmer, J., Dahl, G.E., Vaswani, A., Allen, K.R., Nash, C., Langston, V., Dyer, C., Heess, N.M., Wierstra, D., Kohli, P., Botvinick, M.M., Vinyals, O., Li, Y., & Pascanu, R.
Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis [arXiv]: (2018) Bunel, R., Hausknecht, M.J., Devlin, J., Singh, R., & Kohli, P.
Tree-to-tree Neural Networks for Program Translation [arXiv]: (2018) Chen, X., Liu, C., & Song, D.
Learning Explanatory Rules from Noisy Data [arXiv]: (2017) Evans, R., & Grefenstette, E.
RobustFill: Neural Program Learning under Noisy I/O [arXiv]: (2017) Devlin, J., Uesato, J., Bhupatiraju, S., Singh, R., Mohamed, A. R., & Kohli, P.
DeepCoder: Learning to Write Programs [arXiv]: (2017) Balog, M., Gaunt, A.L., Brockschmidt, M., Nowozin, S., & Tarlow, D.
Mastering the game of Go with deep neural networks and tree search [link]: (2016) Silver, D., Huang, A., Maddison, C.J., Guez, A., Sifre, L., Driessche, G.V., Schrittwieser, J., Antonoglou, I., Panneershelvam, V., Lanctot, M., Dieleman, S., Grewe, D., Nham, J., Kalchbrenner, N., Sutskever, I., Lillicrap, T.P., Leach, M., Kavukcuoglu, K., Graepel, T., & Hassabis, D.
Example-Directed Synthesis: A Type-Theoretic Interpretation [pdf]: (2016) Frankle, J., Osera, P., Walker, D., & Zdancewic, S.
The computational origin of representation and conceptual change [pdf]: (2016) Piantadosi, S.T.
Neuro-Symbolic Program Synthesis [arXiv]: (2016) Parisotto, E., Mohamed, A., Singh, R., Li, L., Zhou, D., & Kohli, P.
Probabilistic data analysis with probabilistic programming [arXiv]: (2016) Saad, F., & Mansinghka, V.
Building Machines that Learn and Think Like People [arXiv]: (2016) Lake, B. M., Ullman, T. D., Tenenbaum. J. B., & Gershman, S. J.
Do you see what I mean? Visual resolution of linguistic ambiguities [arXiv]: (2016) Berzak, Y., Barbu, A., Harari, D., Katz, B., & Ullman, S.
Understanding visual concepts with continuation learning [arXiv]: (2016) Whitney, W. F., Chang, M., Kulkarni, T., & Tenenbaum, J. B.
Neural Programmer-Interpreters [arXiv]: (2015) Reed, S. E., & de Freitas, N.
Human-level concept learning through probabilistic program induction [pdf]: (2015) Lake, B. M., Salakhutdinov, R., & Tenenbaum, J. B.
Deep convolutional inverse graphics network [arXiv]: (2015) Kulkarni, T. D., Whitney, W. F., Kohli, P., & Tenenbaum, J. B.
Concepts in a probabilistic language of thought [pdf]: (2014) Goodman, N. D., Tenenbaum, J. B., & Gerstenberg, T.
Bootstrap learning via modular concept discovery [pdf]: (2013) Dechter, E., Malmaud, J., Adams, R. P., & Tenenbaum, J. B.
Structure Discovery in Nonparametric Regression through Compositional Kernel Search [pdf]: (2013) Duvenaud, D. K., Lloyd, J. R., Grosse, R. B., Tenenbaum, J. B., & Ghahramani, Z.
Bootstrapping in a language of thought: A formal model of numerical concept learning [link]: (2012) Piantadosi S. T., Goodman, N. D., & Tenenbaum, J. B.
Theory learning as stochastic search in a language of thought [pdf]: (2012) Ullman, T. D., Goodman, N. D., & Tenenbaum, J. B.
Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis [pdf]: (2011) McDermott, J. H., & Simoncelli, E. P.
Church: a language for generative models [pdf]: (2008) Goodman, N. D., Mansinghka, V. K., Roy, D. M., Bonawitz, K., & Tenenbaum, J. B.
A rational analysis of rule-based concept learning [pdf]: (2008) Goodman, N. D., Tenenbaum J. B., Feldman J., & Griffiths, T. L.
The rational basis of representativeness [pdf]: (2001) Tenenbaum, J. B. & Griffiths, T. L.

Distributed Systems

No compromises: distributed transactions with consistency, availability, and performance [pdf]: (2015) Dragojević, A., Narayanan, D., Nightingale, E. B., Renzelmann, M., ... & Castro, M.
Large-scale cluster management at Google with Borg [pdf]: (2015) Verma, A., Pedrosa, L., Korupolu, M., Oppenheimer, D., Tune, E., & Wilkes, J.
Wormhole: reliable pub-sub to support geo-replicated internet services [pdf]: (2015) Sharma, Y., Ajoux, P., Ang, P., Callies, D., Choudhary, A., ... & Kumar, S.
In search of an understandable consensus algorithm [pdf]: (2014) Ongaro, D., & Ousterhout, J.
The tail at scale [pdf]: (2013) Dean, J., & Barroso, L. A.
Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing [pdf]: (2012) Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., ... & Stoica, I.
ZooKeeper: wait-free coordination for Internet-scale systems [pdf]: (2010) Hunt, P., Konar, M., Junqueira, F. P., & Reed, B.
PNUTS: Yahoo!'s hosted data serving platform [pdf]: (2008) Cooper, B. F., Ramakrishnan, R., Srivastava, U., Silberstein, A., Bohannon, P., ... & Yerneni, R.
Dynamo: amazon's highly available key-value store [pdf]: (2007) DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., ... & Vogels, W.
Chord: A scalable peer-to-peer lookup service for internet applications [pdf]: (2001) Stoica, I., Morris, R., Karger, D., Kaashoek, M. F., & Balakrishnan, H.