Xiameng Qin

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Yan Liu
130 publications
Jingdong Wang
123 publications
Errui Ding
107 publications
Jianbing Shen
63 publications
Junyu Han
41 publications
Jingtuo Liu
31 publications
Yuwei Wu
29 publications
Xing Li
26 publications
Chengquan Zhang
21 publications
Jiaming Liu
20 publications
Kun Yao
16 publications

research

∙ 07/24/2023

MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary

Document dewarping from a distorted camera-captured image is of great va...

0 Beiya Dai, et al. ∙

research

∙ 06/06/2023

TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision

End-to-end text spotting is a vital computer vision task that aims to in...

0 Yukun Zhai, et al. ∙

research

∙ 05/19/2023

Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding

Transformers achieve promising performance in document understanding bec...

0 Mingliang Zhai, et al. ∙

research

∙ 03/01/2023

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

In this paper, we present StrucTexTv2, an effective document image pre-t...

0 Yuechen Yu, et al. ∙

research

∙ 12/14/2021

Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering

Answering semantically-complicated questions according to an image is ch...

0 JianJian Cao, et al. ∙

research

∙ 08/06/2021

StrucTexT: Structured Text Understanding with Multi-Modal Transformers

Structured text understanding on Visually Rich Documents (VRDs) is a cru...

0 Yulin Li, et al. ∙

research

∙ 09/20/2019

EATEN: Entity-aware Attention for Single Shot Visual Text Extraction

Extracting entity from images is a crucial part of many OCR applications...

13 He Guo, et al. ∙

Success!

An error occurred

Xiameng Qin

Featured Co-authors

MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary

TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision

Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding

StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training

Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering

StrucTexT: Structured Text Understanding with Multi-Modal Transformers

EATEN: Entity-aware Attention for Single Shot Visual Text Extraction

Sign in with Google

Consider DeepAI Pro