Document dewarping from a distorted camera-captured image is of great va...
End-to-end text spotting is a vital computer vision task that aims to
in...
Transformers achieve promising performance in document understanding bec...
In this paper, we present StrucTexTv2, an effective document image
pre-t...
Answering semantically-complicated questions according to an image is
ch...
Structured text understanding on Visually Rich Documents (VRDs) is a cru...
Extracting entity from images is a crucial part of many OCR applications...