Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention

06/17/2021
by   Matthias Springstein, et al.
0

The recognition of handwritten mathematical expressions in images and video frames is a difficult and unsolved problem yet. Deep convectional neural networks are basically a promising approach, but typically require a large amount of labeled training data. However, such a large training dataset does not exist for the task of handwritten formula recognition. In this paper, we introduce a system that creates a large set of synthesized training examples of mathematical expressions which are derived from LaTeX documents. For this purpose, we propose a novel attention-based generative adversarial network to translate rendered equations to handwritten formulas. The datasets generated by this approach contain hundreds of thousands of formulas, making it ideal for pretraining or the design of more complex models. We evaluate our synthesized dataset and the recognition approach on the CROHME 2014 benchmark dataset. Experimental results demonstrate the feasibility of the approach.

READ FULL TEXT
research
08/08/2016

Database of handwritten Arabic mathematical formulas images

Although publicly available, ground-truthed database have proven useful ...
research
03/14/2021

Bangla Handwritten Digit Recognition and Generation

Handwritten digit or numeral recognition is one of the classical issues ...
research
03/01/2019

Adversarial Generation of Handwritten Text Images Conditioned on Sequences

State-of-the-art offline handwriting text recognition systems tend to us...
research
09/16/2016

Image-to-Markup Generation with Coarse-to-Fine Attention

We present a neural encoder-decoder model to convert images into present...
research
05/21/2021

GSSF: A Generative Sequence Similarity Function based on a Seq2Seq model for clustering online handwritten mathematical answers

Toward a computer-assisted marking for descriptive math questions,this p...
research
01/21/2019

Pattern Generation Strategies for Improving Recognition of Handwritten Mathematical Expressions

Recognition of Handwritten Mathematical Expressions (HMEs) is a challeng...
research
01/31/2023

FLAME: A small language model for spreadsheet formulas

The widespread use of spreadsheet environments by billions of users pres...

Please sign up or login with your details

Forgot password? Click here to reset