Fine-Grained Object Detection over Scientific Document Images with Region Embeddings

10/28/2019
by   Ankur Goswami, et al.
0

We study the problem of object detection over scanned images of scientific documents. We consider images that contain objects of varying aspect ratios and sizes and range from coarse elements such as tables and figures to fine elements such as equations and section headers. We find that current object detectors fail to produce properly localized region proposals over such page objects. We revisit the original R-CNN model and present a method for generating fine-grained proposals over document elements. We also present a region embedding model that uses the convolutional maps of a proposal's neighbors as context to produce an embedding for each proposal. This region embedding is able to capture the semantic relationships between a target region and its surrounding context. Our end-to-end model produces an embedding for each proposal, then classifies each proposal by using a multi-head attention model that attends to the most important neighbors of a proposal. To evaluate our model, we collect and annotate a dataset of publications from heterogeneous journals. We show that our model, referred to as Attentive-RCNN, yields a 17 mAP improvement compared to standard object detection models.

READ FULL TEXT
research
02/10/2023

End-to-end Semantic Object Detection with Cross-Modal Alignment

Traditional semantic image search methods aim to retrieve images that ma...
research
08/19/2019

C-RPNs: Promoting Object Detection in real world via a Cascade Structure of Region Proposal Networks

Recently, significant progresses have been made in object detection on c...
research
01/10/2019

Region Proposal by Guided Anchoring

Region anchors are the cornerstone of modern object detection techniques...
research
08/25/2020

Graphical Object Detection in Document Images

Graphical elements: particularly tables and figures contain a visual sum...
research
07/07/2022

Should All Proposals be Treated Equally in Object Detection?

The complexity-precision trade-off of an object detector is a critical p...
research
07/27/2023

Small, but important: Traffic light proposals for detecting small traffic lights and beyond

Traffic light detection is a challenging problem in the context of self-...

Please sign up or login with your details

Forgot password? Click here to reset