Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

by   Chen-Yu Lee, et al.
University of California, San Diego

We present recursive recurrent neural networks with attention modeling (R^2AM) for lexicon-free optical character recognition in natural scene images. The primary advantages of the proposed method are: (1) use of recursive convolutional neural networks (CNNs), which allow for parametrically efficient and effective image feature extraction; (2) an implicitly learned character-level language model, embodied in a recurrent neural network which avoids the need to use N-grams; and (3) the use of a soft-attention mechanism, allowing the model to selectively exploit image features in a coordinated way, and allowing for end-to-end training within a standard backpropagation framework. We validate our method with state-of-the-art performance on challenging benchmark datasets: Street View Text, IIIT5k, ICDAR and Synth90k.


page 8

page 10


Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks

In this work, we jointly address the problem of text detection and recog...

Visual attention models for scene text recognition

In this paper we propose an approach to lexicon-free recognition of text...

Conditionally Learn to Pay Attention for Sequential Visual Task

Sequential visual task usually requires to pay attention to its current ...

Telugu OCR Framework using Deep Learning

In this paper, we address the task of Optical Character Recognition(OCR)...

Fully Convolutional Speech Recognition

Current state-of-the-art speech recognition systems build on recurrent n...

Training and Generating Neural Networks in Compressed Weight Space

The inputs and/or outputs of some neural nets are weight matrices of oth...

A Hybrid Framework for Sequential Data Prediction with End-to-End Optimization

We investigate nonlinear prediction in an online setting and introduce a...

Code Repositories


using rnn (lstm or gru) and ctc to convert line image into text, based on torch7 and warp-ctc

view repo

Please sign up or login with your details

Forgot password? Click here to reset