Let's Enhance: A Deep Learning Approach to Extreme Deblurring of Text Images

11/18/2022
by   Theophil Trippe, et al.
0

This work presents a novel deep-learning-based pipeline for the inverse problem of image deblurring, leveraging augmentation and pre-training with synthetic data. Our results build on our winning submission to the recent Helsinki Deblur Challenge 2021, whose goal was to explore the limits of state-of-the-art deblurring algorithms in a real-world data setting. The task of the challenge was to deblur out-of-focus images of random text, thereby in a downstream task, maximizing an optical-character-recognition-based score function. A key step of our solution is the data-driven estimation of the physical forward model describing the blur process. This enables a stream of synthetic data, generating pairs of ground-truth and blurry images on-the-fly, which is used for an extensive augmentation of the small amount of challenge data provided. The actual deblurring pipeline consists of an approximate inversion of the radial lens distortion (determined by the estimated forward model) and a U-Net architecture, which is trained end-to-end. Our algorithm was the only one passing the hardest challenge level, achieving over 70 recognition accuracy. Our findings are well in line with the paradigm of data-centric machine learning, and we demonstrate its effectiveness in the context of inverse problems. Apart from a detailed presentation of our methodology, we also analyze the importance of several design choices in a series of ablation studies. The code of our challenge submission is available under https://github.com/theophil-trippe/HDC_TUBerlin_version_1.

READ FULL TEXT

page 4

page 13

page 14

page 16

page 17

page 18

page 20

page 22

research
09/20/2023

Large Synthetic Data from the arXiv for OCR Post Correction of Historic Scientific Articles

Scientific articles published prior to the "age of digitization" ( 1997)...
research
01/17/2023

Face Inverse Rendering via Hierarchical Decoupling

Previous face inverse rendering methods often require synthetic data wit...
research
08/31/2018

Full Workspace Generation of Serial-link Manipulators by Deep Learning based Jacobian Estimation

Apart from solving complicated problems that require a certain level of ...
research
12/01/2017

InverseNet: Solving Inverse Problems with Splitting Networks

We propose a new method that uses deep learning techniques to solve the ...
research
11/03/2020

Tabular Transformers for Modeling Multivariate Time Series

Tabular datasets are ubiquitous in data science applications. Given thei...
research
08/26/2023

Homological Convolutional Neural Networks

Deep learning methods have demonstrated outstanding performances on clas...
research
02/26/2021

Beyond Convolutions: A Novel Deep Learning Approach for Raw Seismic Data Ingestion

Traditional seismic processing workflows (SPW) are expensive, requiring ...

Please sign up or login with your details

Forgot password? Click here to reset