PixelLink: Detecting Scene Text via Instance Segmentation

01/04/2018
by   Dan Deng, et al.
0

Most state-of-the-art scene text detection algorithms are deep learning based methods that depend on bounding box regression and perform at least two kinds of predictions: text/non-text classification and location regression. Regression plays a key role in the acquisition of bounding boxes in these methods, but it is not indispensable because text/non-text prediction can also be considered as a kind of semantic segmentation that contains full location information in itself. However, text instances in scene images often lie very close to each other, making them very difficult to separate via semantic segmentation. Therefore, instance segmentation is needed to address this problem. In this paper, PixelLink, a novel scene text detection algorithm based on instance segmentation, is proposed. Text instances are first segmented out by linking pixels within the same instance together. Text bounding boxes are then extracted directly from the segmentation result without location regression. Experiments show that, compared with regression-based methods, PixelLink can achieve better or comparable performance on several benchmarks, while requiring many fewer training iterations and less training data.

READ FULL TEXT

page 1

page 3

page 5

research
06/28/2020

A Survey on Instance Segmentation: State of the art

Object detection or localization is an incremental step in progression f...
research
11/30/2018

TextMountain: Accurate Scene Text Detection via Instance Segmentation

In this paper, we propose a novel scene text detection method named Text...
research
07/06/2021

Plot2Spectra: an Automatic Spectra Extraction Tool

Different types of spectroscopies, such as X-ray absorption near edge st...
research
06/26/2020

Text Detection on Roughly Placed Books by Leveraging a Learning-based Model Trained with Another Domain Data

Text detection enables us to extract rich information from images. In th...
research
05/22/2018

Learning Markov Clustering Networks for Scene Text Detection

A novel framework named Markov Clustering Network (MCN) is proposed for ...
research
11/09/2016

Generative Shape Models: Joint Text Recognition and Segmentation with Very Little Training Data

We demonstrate that a generative model for object shapes can achieve sta...
research
08/21/2023

SRFormer: Empowering Regression-Based Text Detection Transformer with Segmentation

Existing techniques for text detection can be broadly classified into tw...

Please sign up or login with your details

Forgot password? Click here to reset