The scope of professional sports has changed over the years. I remember watching every minute of the 2003 Cricket World Cup and spending every waking minute tracking statistics, like the total runs scored, highest run-scorer, highest run rate, and so on.
It was fairly rudimentary stuff but enough to keep me glued to the screen. How times have changed since then!
Sports analytics is quickly becoming mainstream. Media outlets and leading sports websites regularly curate statistics, produce deep technical insights, and add a whole new level of analysis we haven’t seen before.
We can now answer questions like the below ones with a high degree of confidence:
And so on. Honestly, the sky is the limit when it comes to sports analytics use cases. I’m a sports lover and I’m always looking out for applications where I can apply my analytics and machine learning knowledge to improve the team strategy as well as fan experience.
I’ll introduce you to the awe-inspiring world of sports analytics in this article. We will look at the different types of sports analytics, why this field is important, and we’ll also work on a use case of sports analytics – analyzing cricket commentary to generate insights.
Sports Analytics is all about analyzing and extracting useful insights from sports data.
I would broadly dive Sports Analytics into 2 categories:
Let us discuss each category here.
Descriptive Sports Analytics is about summarizing the sports data in the form of numbers. In other words, to come up with important statistics. This might sound like a simple concept but it’s a very powerful one.
The thought behind descriptive sports analytics plays a crucial role in team tactics.
Let’s take cricket for example. Here, we can analyze how frequently a batsman gets out to a specific bowler. This number will decide the bowling strategy of a team.
Here is an awesome video that analyzes the dismissals of Virat Kohli against Adam Zampa:
This is the reason why Adam Zampa was brought back into the attack whenever Virat Kohli was at the crease during Australia’s tour of India in 2020. In this series, Virat Kohli lost his wicket to Zampa in two out of three matches!
Another interesting use case in cricket is to analyze the team’s probability of winning a match while batting first as well as second. This influences the captain who wins the toss and has to make a decision – bat or bowl first.
Predictive Sports Analytics is about making predictions using sports data. One such use case in cricket is to predict the number of runs a batsman scores against an opponent in a particular match. This would help the team management and captain select the best team for every match.
In a sport like football, predictive sports analytics helps to understand the chances of scoring a goal from any location on the pitch.
You can think of similar use cases for your favorite sport and let me know in the comments section below the article.
Sports Analytics is a game-changer – there’s no other way to put it. Using analytics in sports directly impacts the decision making of a team and can alter the future of the franchise or club (or country). It can easily change the outcome of the match.
Sports Analytics can be a Game Changer.
There is a lot of scope for analytics in sports. In this section, I am going to discuss a few use cases of analytics in different sports, like cricket, football, and tennis.
In cricket, we can analyze the strong and weak zone of a player. This would help the opponent and player understand the strengths and weaknesses of how he plays.
Here is an awesome video that showcases the weak zone of Virat Kohli:
The footballing world has been slow to adopt analytics but it’s quickly gathering pace now. We’re seeing the mainstream media using analytics numbers, such as expected goals and expected assists to analyze players and matches.
You should definitely keep an eye out on the Expected Goals (xG) metric. xG basically tells us the probability of a shot converting into a goal. This varies from player to player and from what position the shot is being taken. It’s quite a fascinating concept and you can read more about it here.
Another example of analytics in football is analyzing team formation while the match is going on. This would help the opponent to understand the team strategy and play according to it.
In tennis, we can identify the combination of shots a player usually plays to win a point. This can be of great use to prepare a strategy against the opponent as well.
I’m sure you must have seen the statistics that come up on screen after the end of each set at a tennis Grand Slam. Features like the number of first serves returned, the placement of the serve, the bounce of the serve and where the opponent picked it up – these are all examples of sports analytics in tennis.
Let’s take up a real-world case study now to understand how sports analytics works. I am going to delve into my personal passion, cricket, for this case study.
I’m an avid follower of text commentary in cricket. An insightful commentator describes the events happening on the ground in good detail, right? There is a lot of online cricket commentary available on many sports websites like CricBuzz, ESPN Cricinfo, etc. This is a gold mine that can reveal many interesting and valuable insights into a team and player.
About the Dataset for Sports Analytics
I have collected the commentary of the last 4 years of the T20 matches played by India. Download a sample dataset from here. It’s time to analyze the commentary and find some appealing insights. Let’s do it!
Implementation
Let us first read the dataset and understand the different columns in the dataset:
After this section, you will be able to answer the below questions:
Ready? Let’s get our hands dirty now!
The total number of T20s India played in the last 4 years:
Output: Total no. of T20s India played in last 4 years:54
No. of T20s India played each year:
Team Average Score (Batting First & Second):
Output: Batting First Team Average :180.0 Batting Second Team Average:156.0
Team Average Innings wise over the years:
Inferences:
Overall Winning % (Batting First & Second):
Output: Over all Winning % : 66.66 Batting First Winning % : 59.0 Batting Second Winning %: 76.0
Winning % against different teams:
Inferences:
Batting First Winning Score:
Winning % over the years:
Inferences:
In this section, I will focus on the batting performance of team India in terms of the strike rate. We’ll also discuss how India’s performance has evolved over a period of time.
Strike rate can be defined as the average number of runs scored per 100 balls. The higher the strike rate, the better the batsman is.
Let’s find out the phases where team India can improve its batting.
Overall batting strike rate of Indian team:
Output: Strike rate of Indian team is 138.66
Team batting strike rate over the years:
Inference:
Team batting strike rate across different phases of a match:
Inferences:
Team batting strike rate across different phases of a match over the years:
Inferences:
In this section, let’s unleash the bowling performance of team India in terms of Economy rate, Bowling Strike rate, and Bowling Average. And also how the performance has evolved over time.
Its time to analyze the bowling performance of the Indian team.
Team India Economy rate in different phases of a match:
Inferences:
Team India bowling performance across different phases of a match:
Inferences:
Team bowling strike rate across different phases of a match over the years:
Inferences:
In this section, we will be analyzing the average number of balls conceded by team India to score a boundary and also its evolution over the years.
Output: Avg no. of balls to hit 4: 9 Avg no. of balls to hit 6: 19
Avg number of balls to score boundary over the years:
Inferences:
Avg number of balls to score boundary across different phases of the match:
Inferences:
Avg number of balls to score 4 across different phases over the years:
Inferences:
Avg number of balls to score 6 across different phases over the years:
Inferences:
Unquestionably, Descriptive Sports Analytics has a far-reaching role to play in a team winning strategy compared to Predictive Sports Analytics. In this article, you have learned the importance of Sports Analytics and how analytics can impact different sports. We also analyzed team India’s performance over the past 4 years in T20 cricket.
Kindly leave your queries/feedback in the comments sections, I will reach out to you. Have fun implementing these ideas for your favorite sport!
Aravind, First of kudos on this great article. As an avid cricket fan it is great to learn on what analytics can do for this great sport. With going back to 1500 as its starting days the extent of tech and analytics in cricket dwarfs when compared to baseball and football. So, this definitely is valuable the questions that you have posed to be answered by this analysis - do i really need commentary data? Cant i just use the scorecard of every game? That analysis may be straight forward as opposed to mining the commentary data.
Thanks. A commentary can give us a lot of interesting insights other than runs and wickets. That's the reason for choosing commentary data for the analysis.
Hi, I am Shubham Kulkarni and just like you, I am a big cricket fan. I also analyze Cricket and make videos on Youtube. I am deeply interested in Cricket Analytics and have done a few things with stats. I studied some of the best Test and ODI players and came up with a formula to decide who are the better ones. I have used some really basic stuff and would like to use some more analytics to do the same. I have some plans in place using all this to create some amazing stuff. Can we work together and try producing some amazing stuff? Is it possible?
Great work bro ! love it . How can I get the whole 4 years dataset ?