Data Science Books
Last updated: September 05, 2023
Data science covers a variety of disciplines and we have expert book recommendations that cover it all. Statistics, data analytics, data vizualisation and the computer language Python.
We spoke to Roger D. Peng, Professor of Biostatistics at Johns Hopkins University to get an overview of data science: "Data science is a pretty big tent. It encompasses a lot of people, and that’s kind of the point. One of the reasons this new concept of ‘data science’ has appeared in recent years is that it covers a wide range of activities that many people have been doing all along."
-
1
Factfulness: Ten Reasons We're Wrong About The World — And Why Things Are Better Than You Think
by Hans Rosling -
2
The Signal and the Noise
by Nate Silver -
3
Superforecasting: The Art and Science of Prediction
by Dan Gardner & Philip E Tetlock -
4
Thinking in Bets: Making Smarter Decisions When You Don't Have All the Facts
by Annie Duke -
5
Hello World: How to Be Human in the Age of the Machine
by Hannah Fry
The best books on Using Data to Understand the World, recommended by Edouard Mathieu
The best books on Using Data to Understand the World, recommended by Edouard Mathieu
Even as more and more data becomes available, many of us have a view of the world that doesn’t correspond to reality. On probabilities in particular, people tend to be completely clueless. Here Edouard Mathieu, Head of Data at Oxford-based research group Our World in Data, recommends books to help readers not only use data to better understand the world, but also make better decisions in daily life.
-
1
Statistical Evidence: A Likelihood Paradigm
by Richard Royall -
2
Visualize This: The FlowingData Guide to Design, Visualization, and Statistics
by Nathan Yau -
3
Storytelling with Data: A Data Visualization Guide for Business Professionals
by Cole Nussbaumer Knaflic -
4
An Introduction to Statistical Learning: with Applications in R
by Daniela Witten, Gareth James, Robert Tibshirani & Trevor Hastie -
5
Design Thinking: Understanding How Designers Think and Work
by Nigel Cross
The best books on Data Science, recommended by Roger D. Peng
The best books on Data Science, recommended by Roger D. Peng
From complex techniques only used by academic statisticians, data science has risen to extreme popularity in only a few years. Roger D. Peng, Professor of Biostatistics at Johns Hopkins University and founder of one of the largest data science online courses, helps us understand this discipline and recommends the five best books to delve into it.
-
1
Learn Python the Hard Way
by Zed A. Shaw -
2
Coders at Work: Reflections on the Craft of Programming
by Peter Seibel -
3
Big Data: Principles and Best Practices of Scalable Realtime Data Systems
Nathan Marz (with James Warren) -
4
How To Lie With Statistics
by Darrell Huff -
5
Computer Organization and Design MIPS Edition: The Hardware/Software Interface
by David A. Patterson & John L. Hennessy
The best books on Learning Python and Data Science, recommended by Vicki Boykis
The best books on Learning Python and Data Science, recommended by Vicki Boykis
What do we mean when we talk about ‘big data’, and how can be become better critical consumers of it? Data scientist Vicki Boykis recommends the best books for learning Python—a language, she says, as versatile as a Swiss Army knife—and shows that it’s possible to teach yourself coding and data science.
-
1
Structure and Interpretation of Computer Programs
by Gerald Jay Sussman, Harold Abelson & Julie Sussman -
2
The Algorithm Design Manual
by Steven S. Skiena -
3
The Pragmatic Programmer: From Journeyman to Master
by Andrew Hunt & David Thomas -
4
The Art of Readable Code
by Dustin Boswell & Trevor Foucher -
5
Style: Lessons in Clarity and Grace
by Joseph Bizup & Joseph M. Williams
The best books on Computer Science for Data Scientists, recommended by Hadley Wickham
The best books on Computer Science for Data Scientists, recommended by Hadley Wickham
Data science is often said to be built on three pillars: domain expertise, statistics, and programming. Hadley Wickham, Chief Scientist at RStudio and creator of many packages for the R programming language, chooses the best books to help aspiring data scientists build solid computer science fundamentals.