Computer vision (CV) is one of the hottest research topics in machine learning these days. From self-driving cars to Instagram and Facebook’s object detection technology, it has seen a rapid rise in recent times thanks to advances in hardware.
Object detection is easily one of the most common applications of computer vision. Thanks to the wonderful open-source community ML has, object detection has seen a lot of interest as more and more data scientists and ML practitioners line up to break new ground. Keep up with that trend, Google, one of the leaders in ML (perhaps THE leader in ML), has released the latest version of it’s popular TensorFlow Object Detection API framework.
Ever since it’s release last year, the TensorFlow Object Detection API has regularly received updates from the Google team. These updates have included pretrained models trained on datasets like Open Images, among other things. We have seen the community embrace this framework with open arms – detecting objects on a football field, pedestrian counting, finding cracks in the streets, etc. There have been all sorts of amazing experiments performed using this API.
And here comes the latest, and quite a major update released by Google. The changes were announced in a blog post and we have mentioned the highlights below:
This is another quality example of Google’s efforts to expand the reach of ML and help newcomers and practitioners get the most out of their work. This will undoubtedly help in advancing the research efforts in computer vision and object detection. They have even released a short tutorial on how to train a model on their Cloud TPUs, which you can check out in their blog post.
On the other side of the story, Google of course benefits as more and more data scientists and ML folks are integrated into the TensorFlow community. It’s a win-win situation for all sides! If you need help getting started with object detection, check out the below guide to get you on your way:
Understanding and Building an Object Detection Model from Scratch in Python
You can also enroll in Analytics Vidhya’s soon-to-be-launched ‘Computer Vision using Deep Learning‘ course which will cover a whole host of topics using real-world case studies. It’s a beginner friendly course and does not assume any familiarity with computer vision.
Sometime I wonder how searchengines are working. I have been looking for detection tecnology and they recommend me your site. It is very interesting. At least the part I understood, but it is definetly not what I am looking for