Top 7 Data Science Books To Transform: Novice to Intermediate

Gargeya Last Updated : 01 Jul, 2021
6 min read

This article was published as a part of the Data Science Blogathon.

What you can’t find in someone’s voice, you might find in someone’s writing.

I was always more inclined to following and referring video tutorials/lectures whenever it comes down to studying something on my own from the web. I found it easier ( just like some of you ) to understand and not go through the pain of reading available books. Most of the time, I felt the same unless recently I discovered those writers or publishers who eliminated the element of ‘bore’ from subject books and made them so… much interesting.

This started when one of my really smart friends told me to start reading books because they contain more content and adds to a really important skill for any person, that is reading and understanding. Initially, I was not keen on doing that unless he also mentioned a publisher whose books are really fun to read and interactive. This got me thinking: “does something like this really exists?” so to confirm my doubt I gave it a try and unraveled this whole new dimension of amazing books that I could read for hours.

Today, I am making myself that well-wishers of yours and sharing with you those books, those publishers whose books will make you think twice before turning your face away from books.

My favorite Publishers:

1. Manning Publications

They are an American-based publishing company founded in 1990, they mostly publish books on computer technology topics and a lot of them are worldwide famous and loved by millions of readers. The most amazing ones are mentioned below along with all my other top data science books.

2. O ‘Reilly Media Inc.

This is another absolutely fantastic American learning organization set up by Tim O’Reilly that distributes books, produces tech gatherings, and gives a web-based learning stage. Their particular image includes a woodcut of a creature on a significant number of their book covers. Their books contain lots and lots of content on the latest and high-tech topic so that their readers can become profound in those areas and excel at their work.

My Favourite Data Science Books:

1. Python Data Science Handbook by Jake VanderPlas published by O ‘Reilly.

data science books python

This book is best for those who just started doing Data Analysis or Data Science and need a go-to book to refer to all the techniques and library functionalities and strengthen their grip on python for data science and letting it work for you. The book covers these topics in great detail and depth: { IPython (Interactive Python), Numpy, Data Manipulation with Pandas, Visualization with matplotlib, Supervised and some Unsupervised Machine learning algorithms with scikit-learn }. The amount and quality of content available on these topics will significantly contribute to harnessing your skills for the first few steps in any data science project cycle.

 

2. Practical Statistics for Data Scientists by Peter Bruce, Andrew Bruce & Peter Gedeck published by O ‘Reilly.

data science books statistics

The Second edition of this book is already released and personally speaking, even if you are a starter or a practitioner, reading this book will be beneficial to you, because there are a lot of skills that I gained from this book, few were those that I have forgotten over the period of time and some that I didn’t know already. After reading this book I started feeling more confident and I can say that it was worth the read.

It includes such topics: {EDA, Data and Sampling Distribution, Statistical Experiments and Significance Testing, Regression, Classification, Statistical ML and Unsupervised Learning}. So, if you are a beginner, I would recommend you to read the 1st book and after that directly jump to this book and make yourself familiar with a lot of new skills in data science.

 

3. Introducing Data Science by Davy Cielen et.al published by Manning Publications

data science books DS

I like this book for a special reason and that is, the books contain not only the topics of data science that we see everywhere, it also includes other aspects of Data Science as a field, such as { NoSQL databases, Text mining, Text analysis, First step in Big data, and especially handling large data on a single computer.} Understanding and working with the integration of databases in your data science project is a really helpful and sought-out skill. I highly recommend you to give this a read and more or less get yourself familiar with above mentioned extra skills in your arsenal.

 

4.The Art of Statistics Learning from Data by David Spiegelhalter published by pelican publications

The Art of Statistics Learning

This book was specially recommended to me by my instructor while I was pursuing my course of Applied Data Science on Coursera by the University of Michigan. They significantly drove us into realizing the importance of skills (to be more accurate, Art) of visualization so that your visualization doesn’t say what they are not supposed to and feels self-explanatory to the reader. I highly recommend this book to those who wish to understand the depth of data visualization and master the skill.

 

5. Data Science from Scratch by Joel Grus published by O’Reilly

Data Science from Scratch

The second edition of this book is already released and it has been a popular book due to the fact that it encounters various fundamentals altogether in this single book. Starting from Crash course on Python, Visualizing Data, Linear Algebra and Statistics, Probability, Hypothesis and Inference, Getting and working with data and many more topics relating to data along with Machine learning, Neural Nets, Recommender systems, Network Analysis and whatnot involved as well. It’s a full package deal and you should definitely consider giving it a read.

6. R for Data Science by Hadley Wickham & Garrett Grolemund published by O’ Reilly

R for Data Science

Okay, so I admit that I love working with python and have only mentioned the books based on Python for Data Science. But this book is for people who like or what you give ‘R’ programming language a shot. I have tried this language and it’s nice but most of the work is related to python so I never really considered shifting my programming language to R. This book breaks that bias, I really enjoyed reading and implementing this book while I was learning ‘R’. You should definitely read this book if you are thinking of doing something fun or new in data science like learning a new language for Data science tasks. The books will tell you all about it. Definitely worth checking out.

7. Think Stats by Allen B. Downey published by O’ Reilley

Think Stats by Allen B. Downey

Think Stats is a prologue to Probability and Statistics for Python software engineers and Data Scientists (if you already are not familiar with these topics in detail).

Think Stats underlines straightforward methods that you can use to explore real data sets and answer intriguing problems. The book presents a contextual analysis utilizing data from the National Institutes of Health.

If you have essential skills in Python, you can utilize them to learn ideas in probability and statistics. Think Stats depends on a Python library for probability distributions. Many exercises included uses short programs to run various experiments and help readers develop a strong understanding.
Most of the books don’t cover Bayesian statistics yet Think Stats depends on the possibility that Bayesian techniques are too critical to even consider delaying. By exploiting the PMF and CDF libraries (used for probability distribution), it is feasible for amateurs to gain proficiency with the ideas and take care of testing issues.

That’s it for this article, I hope that these books bring more shine to your skillset. Keep Growing, Keep Reading, Keep Flourishing.

Gargeya Sharma

B.Tech 3rd Year Student
Specialized in Deep Learning and Data Science

For getting more info check out my Github Homepage

LinkedIn           GitHub

Photo by Annelies Geneyn on Unsplash

The media shown in this article are not owned by Analytics Vidhya and is used at the Author’s discretion. 

Responses From Readers

Clear

We use cookies essential for this site to function well. Please click to help us improve its usefulness with additional cookies. Learn about our use of cookies in our Privacy Policy & Cookies Policy.

Show details