Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition

11/02/2018
by   Hui Li, et al.
0

Recognizing irregular text in natural scene images is challenging due to the large variance in text appearance, such as curvature, orientation and distortion. Most existing approaches rely heavily on sophisticated model designs and/or extra fine-grained annotations, which, to some extent, increase the difficulty in algorithm implementation and data collection. In this work, we propose an easy-to-implement strong baseline for irregular scene text recognition, using off-the-shelf neural network components and only word-level annotations. It is composed of a 31-layer ResNet, an LSTM-based encoder-decoder framework and a 2-dimensional attention module. Despite its simplicity, the proposed method is robust and achieves state-of-the-art performance on both regular and irregular scene text recognition benchmarks. The code will be released.

READ FULL TEXT

page 1

page 7

research
04/02/2019

A Simple and Robust Convolutional-Attention Network for Irregular Text Recognition

Reading irregular text of arbitrary shape in natural scene images is sti...
research
01/10/2019

A Multi-Object Rectified Attention Network for Scene Text Recognition

Irregular text is widely used. However, it is considerably difficult to ...
research
06/06/2020

A Robust Attentional Framework for License Plate Recognition in the Wild

Recognizing car license plates in natural scene images is an important y...
research
08/06/2019

Symmetry-constrained Rectification Network for Scene Text Recognition

Reading text in the wild is a very challenging task due to the diversity...
research
02/22/2021

CSTR: A Classification Perspective on Scene Text Recognition

The prevalent perspectives of scene text recognition are from sequence t...
research
06/13/2019

2D Attentional Irregular Scene Text Recognizer

Irregular scene text, which has complex layout in 2D space, is challengi...
research
01/01/2022

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

In the last decades, scene text recognition has gained worldwide attenti...

Please sign up or login with your details

Forgot password? Click here to reset