In an era where data science and machine learning are reshaping our world, Joshua Starmer stands out as a leading educator and innovator. With a unique background in computer science and a passion for biology, he has carved a path that merges these fields seamlessly. Through his journey, he identified a niche in data analytics and machine learning, integrating his computational skills with biological research. Starmer’s story and insights offer a fascinating glimpse into the world of education, adaptation, and the power of merging diverse skill sets.
You can listen to this episode of Leading with Data on popular platforms like Spotify, Google Podcasts, and Apple. Pick your favorite to enjoy the insightful content!
Let’s look into the details of our conversation with Joshua Starmer!
My journey into data science and machine learning began with a fascination for computers and programming that I nurtured since childhood. However, it wasn’t until much later, after earning a degree in computer science and working in a hospital on database work, that I took a biology course that completely captivated me. This newfound interest in biology led me to explore how I could merge my computational skills with biological research.
Eventually, I pursued a PhD in bioinformatics, which is essentially the application of statistics to biological data. My goal was to conduct biological research, but I ended up in a genetics laboratory at the University of North Carolina, where I realized my true passion lay in data analytics. This realization led to the creation of my YouTube channel, StatQuest, as a means to teach myself statistics and share that knowledge with others.
StatQuest was born out of my desire to better understand statistics and to communicate complex analytical concepts to my colleagues in a relatable and understandable way. Initially, the channel was intended for a small audience—my coworkers in the lab. I used examples from our research on mice to explain statistical methodologies. The channel’s success among my colleagues encouraged me to continue creating content, and eventually, it caught the attention of a broader audience. A video on principal component analysis marked a turning point, expanding my reach and solidifying my role as an educator in the field of data science.
In the early days, my content was driven by the immediate needs of my lab colleagues. As I transitioned to full-time content creation, I had to abstract from my direct lab experience and anticipate the broader needs of the data science community. I began conducting workshops and consulting to stay connected with real-world applications of data science. This hands-on experience has been invaluable in creating content that is not only informative but also grounded in practical use cases.
The biggest challenge is often starting from a place of confusion. I dive deep into reading and coding to understand new concepts like state space models. This process can be time-consuming, with some videos taking years to produce. However, my goal is to distill complex ideas into simple, visual explanations that resonate with a wide audience. I strive to create content that is exceptional and above average, which means constantly refining and updating my approach.
Running a business involves much more than just creating videos. I handle customer service, website maintenance, and various administrative tasks, which can limit the time I spend on actual content creation. Despite these demands, I’m exploring ways to streamline business operations to potentially return to consulting or lab work part-time. This would allow me to stay connected with the practical side of data science and continue to improve as an educator.
Generative AI has been useful for generating rough drafts, whether it’s for programming or explaining concepts. It helps transform the intimidating “blank page problem” into an editing problem, giving me a starting point to refine and tailor the content for teaching purposes. While I don’t rely on generative AI extensively, it serves as a helpful tool for brainstorming and overcoming initial creative hurdles.
I’m currently working on a book dedicated entirely to neural networks. My first book, “The StatQuest Illustrated Guide to Machine Learning,” provided a broad overview of machine learning techniques. However, given the popularity and complexity of neural networks, I felt they deserved a book of their own. I aim to release this new book by the end of the year, and it will cover neural networks in depth, with the same visual and accessible approach that characterizes my other educational materials.
I wish there was a way to update educational content on YouTube more seamlessly. Just like new editions of books replace old ones on bookstore shelves, I’d like to see a system where updated videos can easily take the place of their outdated versions. This would ensure that learners always have access to the most current and relevant information without having to navigate through multiple versions of the same content.
Joshua Starmer’s journey showcases the power of merging diverse skill sets and a passion for education. Through StatQuest, he has not only filled a gap in data science education but also inspired a global audience to embrace complex topics. His iterative learning process, patient persistence, and innovative use of tools like generative AI offer valuable lessons for educators and content creators. As Starmer continues to evolve his craft, consulting, and exploring new avenues, his impact on the field of data science education will undoubtedly leave a lasting legacy.
For more engaging sessions on AI, data science, and GenAI, stay tuned with us on Leading with Data.