Real-Time Object detection API using Tensorflow and OpenCV

The amount of visual data in the world today has grown exponentially in the last couple of years and this is largely due to lots of sensors everywhere. Building machine learning models that are capable of localizing and identifying obejcts in a single image remains a core challenge in computer vision. Working to solve this problem has ignited my interest into the field.

As a path to my quest, I discovered Google just released an object detection API. The API has been trained on Microsoft COCO dataset { A dataset of about 300,000 images of 90 commonly found objects} with different trainable detection models .

The higher the mAp (minimum average precision), the better the model

Project Description

I started by cloning the Tensorflow object detection repository on github. The API is an open source framework built on tensorflow making it easy to construct, train and deploy object detection models.

For this project [am on windows 10, Anaconda 3, Python 3.6], I was concerned with only the installation part and following the example which included

Installing required libraries e.g pillow, lxml etc that comes with installing the latest version of tensorflow using pip(latest version)
Downloading the latest version of protobuf compatible with my system for compilation on google protobuf releases
Cloning the tensorflow/models folder as a zip file
On my Anaconda prompt, ran this command for protobuf compilation without which the example wont work.

# From tensorflow/models/research/
protoc object_detection/protos/*.proto --python_out=.

I preferred using Spyder as my editor so i made a copy of the notebook in the example and converted it to a python file

Testing the model

Using the pre-trained ssd_mobilenet_v1_coco model (which was fast, though had the least accuracy), I decided to test it out on my own images and the results were amazing !