ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition

12/23/2020
by   Zuoyu Yan, et al.
8

Despite the recent advances in optical character recognition (OCR), mathematical expressions still face a great challenge to recognize due to their two-dimensional graphical layout. In this paper, we propose a convolutional sequence modeling network, ConvMath, which converts the mathematical expression description in an image into a LaTeX sequence in an end-to-end way. The network combines an image encoder for feature extraction and a convolutional decoder for sequence generation. Compared with other Long Short Term Memory(LSTM) based encoder-decoder models, ConvMath is entirely based on convolution, thus it is easy to perform parallel computation. Besides, the network adopts multi-layer attention mechanism in the decoder, which allows the model to align output symbols with source feature vectors automatically, and alleviates the problem of lacking coverage while training the model. The performance of ConvMath is evaluated on an open dataset named IM2LATEX-100K, including 103556 samples. The experimental results demonstrate that the proposed network achieves state-of-the-art accuracy and much better efficiency than previous methods.

READ FULL TEXT
research
12/04/2017

A GRU-based Encoder-Decoder Approach with Attention for Online Handwritten Mathematical Expression Recognition

In this study, we present a novel end-to-end approach based on the encod...
research
11/01/2018

A sequential guiding network with attention for image captioning

The recent advances of deep learning in both computer vision (CV)and nat...
research
08/29/2019

Translating Mathematical Formula Images to LaTeX Sequences Using Deep Neural Networks with Sequence-level Training

In this paper we propose a deep neural network model with an encoder-dec...
research
02/26/2020

Expression Recognition in the Wild Using Sequence Modeling

As we exceed upon the procedures for modelling the different aspects of ...
research
01/23/2022

AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks

This work proposes an attention-based sequence-to-sequence model for han...
research
07/10/2019

Multi-layer Attention Mechanism for Speech Keyword Recognition

As an important part of speech recognition technology, automatic speech ...
research
10/25/2021

Learning Continuous Face Representation with Explicit Functions

How to represent a face pattern? While it is presented in a continuous w...

Please sign up or login with your details

Forgot password? Click here to reset