BADAM: A Public Dataset for Baseline Detection in Arabic-script Manuscripts

07/09/2019
by   Benjamin Kiessling, et al.
7

The application of handwritten text recognition to historical works is highly dependant on accurate text line retrieval. A number of systems utilizing a robust baseline detection paradigm have emerged recently but the advancement of layout analysis methods for challenging scripts is held back by the lack of well-established datasets including works in non-Latin scripts. We present a dataset of 400 annotated document images from different domains and time periods. A short elaboration on the particular challenges posed by handwriting in Arabic script for layout analysis and subsequent processing steps is given. Lastly, we propose a method based on a fully convolutional encoder-decoder network to extract arbitrarily shaped text line images from manuscripts.

READ FULL TEXT

page 2

page 3

page 4

research
05/09/2017

READ-BAD: A New Dataset and Evaluation Scheme for Baseline Detection in Archival Documents

Text line detection is crucial for any application associated with Autom...
research
06/22/2018

Multi-Task Handwritten Document Layout Analysis

Document Layout Analysis is a fundamental step in Handwritten Text Proce...
research
01/20/2021

Text Line Segmentation for Challenging Handwritten Document Images Using Fully Convolutional Network

This paper presents a method for text line segmentation of challenging h...
research
02/23/2021

Page Layout Analysis System for Unconstrained Historic Documents

Extraction of text regions and individual text lines from historic docum...
research
10/15/2021

Accurate Fine-grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation

Accurate layout analysis without subsequent text-line segmentation remai...
research
02/09/2018

A Two-Stage Method for Text Line Detection in Historical Documents

This work presents a two-stage text line detection method for historical...
research
09/27/2018

Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids

The camera captured images have various aspects to investigate. Generall...

Please sign up or login with your details

Forgot password? Click here to reset