27 Amazing Data Science Books Every Data Scientist Should Read

  Books on Probability Probability: For the Enthusiastic Beginner Author: David Morin Ideal book for beginners.

It is written for college students so all of you looking to learn probability from scratch will appreciate the way this is written.

All the basics are covered – combinatorics, the rules of probability, Bayes’ theorem, expectation value, variance, probability density, common distributions, the law of large numbers, the central limit theorem, correlation, and regression.

  Introduction to Probability Authors: J.

Laurie Snell and Charles Miller Grinstead Another introductory book covering basic probability concepts.

Like the book above, this one is a comprehensive text written with college graduate students in mind.

Why do I keep repeating that, you might be wondering.

It’s because I want to emphasize that if there’s a place to start learning from scratch, it’s a book that’s written for students who haven’t ever ventured into this field before.

  An Introduction to Probability Theory and its Applications Author: William Feller As the book’s description states, it’s a complete guide to the theory and practical applications of probability theory.

I recommend reading this if you really want to deep dive into the world of probability.

It’s a VERY comprehensive text and might not be to a beginner’s taste.

If you’re learning probability just to get into data science, you can get away with reading either of the two probability books mentioned above.

  Books on Machine Learning The Hundred-Page Machine Learning Book Author: Andriy Burkov I love this book.

Having read a ton of books trying to teach machine learning from various angles and perspectives, I struggled to find one that could succinctly summarize difficult topics and equations.

Until Andriy Burkov managed to do it in some 100-odd pages.

It is beautifully written, is easy to understand and has been endorsed by thought leaders like Peter Norvig.

Need I say more?.Beginner or established, every data scientist should get their hands on this book.

  Machine Learning Author: Tom Mitchell Before all the hype came about, Tom Mitchell’s book on machine learning was the go-to text to understand the math behind various techniques and algorithms.

I would suggest brushing up on your math before taking this up.

But you don’t need any background in AI or statistics to understand these concepts.

It was the first-ever book I read on ML!.It’s modestly priced so it’s definitely worth adding to your collection.

  Elements of Statistical Learning Authors: Trevor Hastie, Robert Tibshirani and Jerome Friedman And we’re back with another classic by Hastie and Tibsharani!.It’s the natural successor to the ‘Introduction to Statistical Learning’ book we covered earlier.

While there are a few overlaps with that book, this one takes a more advanced look at what we call machine learning algorithms.

Topics like neural networks, matrix factorization, spectral clustering are covered apart from the common ML techniques.

  Books on Deep Learning Deep Learning Authors: Ian Goodfellow, Yoshua Bengio and Aaron Courville What a list of rockstar authors!.The ‘Deep Learning’ book is widely regarded as the best resource for beginners.

It’s divided into three sections: Applied Math and Machine Learning Basics, Modern Practical Deep Learning Frameworks, and Deep Learning Research.

It is to-date the most cited book in the deep learning community.

Keep it by your bedside, worship it and reference it often – this will be your companion whenever you start your deep learning journey.

  Deep Learning with Python Author: Francois Chollet A really cool way of learning deep learning (or machine learning for that matter) is by programming side-by-side with the theory.

And that’s the approach Francois Chollet follows in the ‘Deep Learning with Python’ book.

Concepts are taught using the popular Keras library.

Francois is the creator of Keras so who better to teach you this topic?.I also recommend following Francois on Twitter – there is a lot we can learn from him.

  Neural Networks and Deep Learning Author: Michael Nielsen This is a free online book to learn about the core component that powers deep learning – neural networks.

I quite like the way this book has been written.

It takes a practical approach to teaching and looks at deep learning topics from the lens of a beginner.

You will not learn any programming language in this book – it’s a good old fashioned text book on the underlying insights behind neural networks.

  Books on Natural Language Processing (NLP) Natural Language Processing with Python Authors: Steven Bird, Ewan Klein and Edward Loper Another book in this collection which sticks to the learn by doing policy.

You’ll pick up Python concepts you otherwise wouldn’t have and will navigate the world of NLP using the NLTK library (Natural Language Toolkit).

While this shouldn’t be the only resource you refer to for learning NLP (it’s far too complex a field for that), it offers a pretty decent introduction to the topic.

  Foundations of Statistical Natural Language Processing Authors: Christopher Manning and Hinrich Schutze Published almost two decades ago, this text still serves as an excellent introduction to natural languages processing.

It’s a very comprehensive guide to the broader sub-topics in NLP, like Text Categorization, Parts-of-Speech Tagging, Probabilistic Parsing, among various other things.

The authors have provided a rigorous coverage of mathematical and linguistic foundations.

Again, the book is quite detailed so keep that in mind.

  Speech and Language Processing Authors: Daniel Jurafsky and James H.

Martin The emphasis of this book is on practical applications and scientific evaluation in the scope of natural language and speech.

I included this book to expand our horizons beyond text – to look at speech recognition as well.

And why not?.It’s an area of research that is thriving nowadays with a plethora of applications coming out everyday.

Jurafsky and Martin have written an in-depth book on NLP and computational linguistics.

This one is from the masters themselves.

  Books on Computer Vision Computer Vision: Algorithms and Applications Author: Richard Szeliski Explore a variety of common computer vision techniques in this book, especially ones used for analyzing and interpreting images.

While this was published almost 9 years ago, the examples and methodology illustrated by Richard Szeliski are applicable today as well.

It’s a comprehensive text that takes a scientific approach to solving basic vision challenges.

The website I have linked to above contains a free PDF copy of the book   Programming Computer Vision with Python Author: Jan Erik Solem Before you dive into this awesome book, go to the website I’ve linked above and download the datasets, the code notebooks and clone the GitHub repository mentioned there.

They are excellent companions in this REALLY hands-on introduction to the world of computer vision.

As the author states, “You’ll learn techniques for object recognition, 3D reconstruction, stereo imaging, augmented reality, and other computer vision applications as you follow clear examples written in Python.

”   Computer Vision: Models, Learning, and Inference Author: Dr.

Simon J.


Prince The book starts off from scratch by introducing us to the concepts of probability and quickly picks up pace from there.

While some of the frameworks introduced here have seen more advanced versions come out, this book is nonetheless relevant in the current context.

More than 70 algorithms have been introduced and the text is beautifully complemented by over 350 illustrations.

The website also contains PowerPoint slides, if that’s the kind of learning you prefer.

  Books on Artificial Intelligence Artificial Intelligence: A Modern Approach Authors: Stuart Russell and Peter Norvig A book written by Stuart Russell and Peter Norvig?.I am sold.

It is the leading book in Artificial Intelligence.

More than 1300 universities in over 100 countries reference/cite this book in their curriculum.

Given who the authors are, it isn’t surprising to see the book length – 1100 pages.

Covering the length and breadth of AI components – speech recognition, autonomous vehicles, machine translation, and computer vision among other things, this can be considered the Bible of AI.

  Artificial Intelligence for Humans Author: Jeff Heaton What are the foundational algorithms underneath artificial intelligence?.This book packs a lot of technical know-how into just 222 pages.

This is volume 1 of a series of books on the techniques behind AI (dimensionality, distance metrics, clustering, error calculation, hill climbing, Nelder Mead, and linear regression).

There is an accompanying site as well which contains examples cited in the book + a GitHub repository containing the code.

  The Master Algorithm Author: Pedro Domingos If you’re looking for a technical book on AI, this isn’t it.

What it is, however, is a masterful text on how machine learning is remaking business, politics, science and war.

It is a thoughtful and thought-provoking book on where AI is right now, and where it might end up taking the human race.

Will we ever find a single algorithm (or ‘The Master Algorithm’) that is capable of driving all knowledge from data?.Join Pedro Domingos in his quest to find out.

  Books on Python Fluent Python: Clear, Concise, and Effective Programming Author: Luciano Ramalho There are way too many resources out there to learn Python but nothing teaches you programming like a good old-fashioned book.

As you might expect from a coding book, it’s a hands-on guide to help you understand how Python works and how to write awesome and effective Python code.

Luciano Ramalho also covers a few popular libraries you’ll find yourself regularly using in data science projects.

With a length of 794 pages, this book is worth the spend.

  Programming Python: Powerful Object-Oriented Programming Author: Mark Lutz Wait, another Python book?!.If you thought the above book taught you everything you need to know about Python, think again.

This is a vast programming language with a lot more left to cover.

Once you’ve mastered the fundamentals from the above book by Luciano Ramalho, take a gander on this one by Mark Lutz.

There are in-depth tutorials on a wide variety of topics: databases, networking, text processing, GUIs, etc.

Tons and tons of examples are included.

A must-read for programming geeks.

  Mastering Python for Data Science Author: Samir Madhavan The two books we have covered so far for learning Python looked at the language from a programming perspective.

Now it’s time to learn it from the data science angle.

Which data science libraries are commonly used and how?.How can you create data visualizations and mine for patterns in Python?. More details

Leave a Reply