PixelLink

What it does

Given an Image identify all instances of text present in it. This is useful for autonoumous cars that can accurately detect road signs. This is also useful for people with visual impairment that need help in reading signs and signals in and around their environment. The algorithm does this fast enough that a 60fps video can be recognized in real time. >90% accurracy.

How I built it

Rather than using the general semantic segmentation which introduces several challenges due to different charachters eing close to each other, I tried to implement a concept called Instance Segmentation. Tools used are python, tensorflow and opencv.

Challenges I ran into

It was difficult to understand certain parts of implementation of the paper which needed to be tweaked.

What I learned

What Instance segmentation is and how to implement it in code.

What's next for PixelLink

Test it with a robust dataset and maintain its accuracy.

Built With

python
tensforlow

Updates

Namit Juneja started this project — Sep 07, 2019 04:53 PM EDT

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.