Semi-Bagging Based Deep Neural Architecture to Extract Text from High Entropy Images

07/02/2019
by   Pranay Dugar, et al.
2

Extracting texts of various size and shape from images containing multiple objects is an important problem in many contexts, especially, in connection to e-commerce, augmented reality assistance system in natural scene, etc. The existing works (based on only CNN) often perform sub-optimally when the image contains regions of high entropy having multiple objects. This paper presents an end-to-end text detection strategy combining a segmentation algorithm and an ensemble of multiple text detectors of different types to detect text in every individual image segments independently. The proposed strategy involves a super-pixel based image segmenter which splits an image into multiple regions. A convolutional deep neural architecture is developed which works on each of the segments and detects texts of multiple shapes, sizes, and structures. It outperforms the competing methods in terms of coverage in detecting texts in images especially the ones where the text of various types and sizes are compacted in a small region along with various other objects. Furthermore, the proposed text detection method along with a text recognizer outperforms the existing state-of-the-art approaches in extracting text from high entropy images. We validate the results on a dataset consisting of product images on an e-commerce website.

READ FULL TEXT

page 1

page 5

page 8

research
12/14/2017

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

Detecting and recognizing text in natural scene images is a challenging,...
research
11/21/2016

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

This paper presents an end-to-end trainable fast scene text detector, na...
research
07/27/2017

STN-OCR: A single Neural Network for Text Detection and Text Recognition

Detecting and recognizing text in natural scene images is a challenging,...
research
07/09/2018

Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes

The requirement of large amounts of annotated images has become one gran...
research
12/10/2018

PlaneRCNN: 3D Plane Detection and Reconstruction from a Single Image

This paper proposes a deep neural architecture, PlaneRCNN, that detects ...
research
11/06/2017

Image Segmentation of Multi-Shaped Overlapping Objects

In this work, we propose a new segmentation algorithm for images contain...
research
03/21/2019

Towards Robust Curve Text Detection with Conditional Spatial Expansion

It is challenging to detect curve texts due to their irregular shapes an...

Please sign up or login with your details

Forgot password? Click here to reset