“Ignore the gatekeepers who expect you to have a Ph.D. A relevant study can be more useful than a Ph.D.” – Ahmet Erdem
Golden words that every data science aspirant needs to learn by heart.
We see all sorts of jumbled job descriptions in data science expecting people to touch the sky in terms of their educational background. You need a Ph.D. or you need a Masters in Applied Mathematics, etc.
As someone who just wants to land their first role in data science, this can appear daunting and demotivating. So these are very timely and relevant thoughts by Ahmet Erdem.
Yes, we are delighted to share our second interview of the Kaggle Grandmaster Series with Ahmet Erdem today!
Ahmet is a Kaggle Competitions Grandmaster who currently ranks #8 – right up there in the upper echelons of Kaggle. He has won 12 gold medals and 15 silver medals in the competitions category – a remarkable achievement.
Also, he is a Kaggle Master in Notebooks and Discussions. Ahmet currently works as a Senior Data Scientist at NVIDIA and brings many years of experience across diverse firms to give you an insight into the power of data science and NLP. Also, he has a Masters’s Degree in Artificial Intelligence from KU Leuven University.
This is the second interview in the series of Kaggle Grandmaster Interviews. You can read the first interview here:
So, go through this interview and absorb all you can!
Ahmet Erdem (AE): Actually, the transition from Software Engineering to Data Science is not a big challenge.
“Job definition for Data Scientists varies a lot but I believe a Data Scientist should be good at math/statistics and programming. By studying Computer Science, one can already excel in Math and Programming.”
So what is left is Statistics and eventually Machine Learning. In order to get this missing part, I decided to do a master’s in Artificial Intelligence. There is no single path, self-learning is also possible, but I think this educational background made the difference for me.
AE: 5 key points I can recommend to anyone who wants to switch from software engineering to data science:
AE: My NLP problems at work were usually unsupervised and the data was big. Therefore, the inference time was more important than accuracy. It was more of an engineering challenge and simpler models were more useful for us. But I was interested in advanced models too. And Kaggle was the best place to keep up with all these advanced models.
When I joined Kaggle, LSTMs was something new. I had studied them and practiced them as much as I could. And suddenly, they became outdated thanks to Transformers. Again, Kaggle gave me all the opportunities to get familiar with Transformers.
AE: I believe the only challenge for NLP is the available data and compute power. If we compare current NLP models with us (humans), we have a huge time advantage. Our brains are tuned by evolution for millions of years. And they are being trained every second with everything we experience. Imagine training an NLP model for 5 years with a lot of diverse data.
“What I am trying to say is that I believe consciousness is not binary.”
And even current NLP models have it but by the time they will come to some point that turning them off will feel like you are killing someone. But with the current data privacy policies and available compute power, it will definitely take more than 2-3 years.
AE: Three competitions which were milestones for me:
AE: Yes! Here are the typical steps I follow when participating in a Kaggle Competition:
AE:
“A beginner should focus on learning instead of leaderboards.”
If they get obsessed with their ranking, it may block their learning. Reading other people’s ideas and coming up with their own ideas, and then implementing them is the key to learning.
Maybe 90% of these ideas will not work but the gained knowledge will help them in the next competitions. Otherwise tuning public kernel parameters for achieving a higher score is a total waste of time. So, the goal should not be ranking 71st today, the goal should be ranking first in the future.
This was an amazing interview packed with pearls of wisdom by a Kaggle Grandmaster.
Ahmet’s precise and to the point answers and the ultimate focus on learning is something we can all absorb from this interview. I hope this interview will help you to set your course right and rise up your data science learning aptitude!
This is the second interview in the series of Kaggle Grandmaster Interviews. You can read the first interview here-
Let us know in the comments if you have any other questions that you think we missed. You can also drop any questions you feel you want to ask a future interviewee – we’d love to focus on your thoughts as well!