Flipkart Object Localization

This is my attempt on the Object Localization problem from the Flipkart Grid challenge.

Problem Statement

The problem is an agnostic object detection problem in which only bounding boxes must be predicted without any class labels. Also any pre-trained weights are not allowed.
The training data was provided by Flipkart and consisted of 14k images.
https://dare2compete.com/o/Flipkart-GRiD-Teach-The-Machines-2019-74928

Solution

The solution that was propose to solve the object localization problem consists of a CNN having 4 neurons in the output layer where each corresponds to the 4 output values i.e the bounding boxes coordedinates to be determined. Therefore it will be solved as a regression problem.

Given Image Bounded Box

Architecture

The architecture of the CNN is based on the popular VGG-16 architecture commonly used for image classification problems. The final layer has been modified to output four bounding box coordinates and the activation has changed from softmax to relu.

Results

The CNN model was trained on Google Colab for 10 epochs. It achieved 79% accuracy when tested on the test set.
Due to the inavailability of images for training such a model the accuracy is a bit less.
Also perhaps a bit more data augmentation would have helped.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
model		model
readme		readme
results		results
README.md		README.md
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Flipkart Object Localization

Problem Statement

Solution

Architecture

Results

About

Uh oh!

Releases

Packages

Languages

MayankGupta73/Flipkart-Object-Localization-CNN

Folders and files

Latest commit

History

Repository files navigation

Flipkart Object Localization

Problem Statement

Solution

Architecture

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages