“Being dynamic is important, especially in the Data Science field.” – Mobassir Hossen
To achieve your goal of transitioning into data science or becoming a Grandmaster, a lot of dedication and self-learning is required along with the ability to be a dynamic learner. No matter what your background is.
Not sure where to start?
Well, how about hearing from another Kaggle Grandmaster? That’s right – we are proud to present this third installment in our Kaggle Grandmaster Series with Notebooks Grandmaster Mobassir Hossen!
Mobassir is a Kaggle Notebooks Grandmaster with a Kaggle rank of #44. He is a Kaggle Discussions Master and Kaggle Competitions Expert as well. Also, he graduated with a Software Engineering Degree from Daffodil International University-DIU and currently works as a Data Scientist at Markopolo.ai.
A journey from software engineering to data science? That’s one a lot of people would love to know more about!
So, go through this interview and absorb all you can!
Mobassir Hossen (MH): There are different kinds of fields open for a software engineer and I felt a fascination for a lot of them. So I invested a lot of time working on software security, the Internet of Things, Embedded System design, etc. I was jumping from one ship to another like that and was unable to pick a fixed field for my career.
The main problem was “I felt love for all of those departments and it was hard for me to pick a single one from them for my career”. While reading papers for “Internet of things (IoT)” one day, I found an idea about a project and I thought it would be cool if I can implement that. The idea was to design a system that can detect carbon monoxide (CO) percentage in a room because I had found in some papers that if carbon monoxide percentage crosses a certain limit threshold then it can kill people staying in that room.
So I began searching for a solution that I can try to tackle this problem. Then I heard somewhere people talking about an algorithm called SVM(support vector machine) that can be used to classify CO after taking data from sensors using Arduino. My next search on google was “what is SVM?”.Google told me that it’s a Machine learning algorithm. That is when I came to know about machine learning. I was in my 3rd year’s 1st semester at that time. I started taking machine learning courses to understand algorithms like SVM to solve that IoT problem I had in my head, and somehow I felt extremely addicted to machine learning and started investing too much time learning stuff related to machine learning. This is how I dropped my IoT project and picked Data Science as my career.
MH: Software Engineers know programming required for Software 1.0 whereas data science demands programming skills for software 2.0. In software engineering, we had statistics and mathematics and that helped in my transition. Other than this, I think thee following points also play a role in your transition from SWE toDS:
MH: Here are some of the points I especially look for when I look for machine learning use cases in healthcare startups:
MH: It depends. If someone is willing to solve tabular data problems then:
If someone is willing to solve Radiological/computer vision-related problems then:
For assisting dermatologist one can start with “SIIM-ISIC Melanoma Classification”
Also, Intel & MobileODT Cervical Cancer Screening and APTOS 2019 Blindness Detection are great datasets to explore.
Actually, it depends on the individual’s interest, there are lots of medical data problems. You need to ask yourself “which medical problem you want to solve the most?” and you can start from there. As I said it’s a “dynamic process”. You can start with a problem and realize “well I don’t know much about this problem and also don’t know how to solve this problem through data-driven approaches, but now I am interested”.you can start from having zero knowledge and still end up being a pro. You have almost all the resources required. All you need to do is spend a lot of time googling, reading papers, notebooks, books, etc.
MH: I still remember how it all started. It took me 77 days to finish this Coursera course on Machine Learning by Andrew Ng when I was a 3rd-year undergraduate student. I became so addicted to Machine Learning that I sacrificed university quizzes, presentations, exams, etc even though I had a high CGPA till then.
Why did I do that? They call it “passion” these days 🙂
When I first started my ML journey, at that time in my university every CS student was busy solving competitive programming problems but I found machine learning very interesting. So I wanted to learn ML but I saw no one around me has even basic ML knowledge. Hence, no one around me could guide me well for machine learning.
During my initial days into ML, the answers I received for my crucial career-related questions were very demotivating.
My question: “I am interested in machine learning. I want to become a Data Scientist. Is it the wrong idea/decision?”
Answer 1: “Mobassir, machine learning is the hottest topic now but what happens if after 10 years some other technology replaces machine learning? What will you do? So do code forces competition”
Answer 2: “in Bangladesh very, very fewer companies work on machine learning problems. You are less likely to get a job with this skill here so learn web/android framework and regularly solve competitive problems only”
Answer 3: “if you don’t have heavy math/statistics knowledge then don’t go for machine learning”
These replies really worried me and led to a lot of self-doubts. But anyway, I signed up in kaggle 2 years ago and became part of a community so diverse and collaborative that there was no looking back from there. Today I am very proud that I rejected all surrounding people’s guidance and followed my own, which was toward “my passion”.
MH: When I started my data science journey, I was already having a very busy academic schedule. Consequently, I couldn’t spend much time on Kaggle. I started by “participating in the discussions” and since I was from the SWE background and this helped me learn quickly. I collaborated with people in the discussions forum and later some of them became my good buddies with whom I still compete in Kaggle.
As I said this field demands people with a “dynamic learning attitude”. I have no special talent but I realized that I have a “dynamic learning attitude” and that’s why after so many fluctuations I decided to build my career in the DS field and this leads me to invest a lot of time in the kernels/notebooks section and the GrandMaster title from Kaggle followed. I can assure you that at least 70% of notebooks that I wrote “I started with ZERO knowledge, did a lot of google search and read other’s solution, discussions and by the time I wrote the last lines of those kernels, I knew something” and that is why I think “being dynamic is important, especially for DS field”.
MH: Yeah, I have learned a few techniques from vastly experienced Kagglers in the past and I try to apply those most of the time. They are as follows:
MH: Clear documentation of each segment in markup and Comments describing why the function is needed rather than what it does
Wow! What an inspiring interview that was. Such wise words can only come after a lot of experience.
Mobassir’s journey is a testament to the fact that -one never knows how many doors are open by simply listening to yourself. I hope this interview will help you answer your DS career-related questions more precisely.
This is the third interview in the series of Kaggle Interviews. You can read the first 2 interviews here-
What did you learn from this interview? Are there other data science leaders you would want us to interview? Let me know in the comments section below!