SelfText Beyond Polygon: Unconstrained Text Detection with Box Supervision and Dynamic Self-Training

11/26/2020
by   Weijia Wu, et al.
0

Although a polygon is a more accurate representation than an upright bounding box for text detection, the annotations of polygons are extremely expensive and challenging. Unlike existing works that employ fully-supervised training with polygon annotations, we propose a novel text detection system termed SelfText Beyond Polygon (SBP) with Bounding Box Supervision (BBS) and Dynamic Self Training (DST), where training a polygon-based text detector with only a limited set of upright bounding box annotations. For BBS, we firstly utilize the synthetic data with character-level annotations to train a Skeleton Attention Segmentation Network (SASN). Then the box-level annotations are adopted to guide the generation of high-quality polygon-liked pseudo labels, which can be used to train any detectors. In this way, our method achieves the same performance as text detectors trained with polygon annotations (i.e., both are 85.0 removing the false alarms, it is able to leverage limited labeled data as well as massive unlabeled data to further outperform the expensive baseline. We hope SBP can provide a new perspective for text detection to save huge labeling costs.

READ FULL TEXT

page 3

page 4

page 5

page 8

page 9

page 10

research
03/03/2020

Towards Noise-resistant Object Detection with Noisy Annotations

Training deep object detectors requires significant amount of human-anno...
research
04/28/2021

Segmentation-Based Bounding Box Generation for Omnidirectional Pedestrian Detection

We propose a segmentation-based bounding box generation method for omnid...
research
01/09/2022

Box2Seg: Learning Semantics of 3D Point Clouds with Box-Level Supervision

Learning dense point-wise semantics from unstructured 3D point clouds wi...
research
12/10/2018

EDF: Ensemble, Distill, and Fuse for Easy Video Labeling

We present a way to rapidly bootstrap object detection on unseen videos ...
research
10/07/2020

Unconstrained Text Detection in Manga

The detection and recognition of unconstrained text is an open problem i...
research
09/21/2023

2DDATA: 2D Detection Annotations Transmittable Aggregation for Semantic Segmentation on Point Cloud

Recently, multi-modality models have been introduced because of the comp...
research
03/17/2022

deepNIR: Datasets for generating synthetic NIR images and improved fruit detection system using deep learning techniques

This paper presents datasets utilised for synthetic near-infrared (NIR) ...

Please sign up or login with your details

Forgot password? Click here to reset