Recurrent Video Restoration Transformer with Guided Deformable Attention

06/05/2022
by   Jingyun Liang, et al.
4

Video restoration aims at restoring multiple high-quality frames from multiple low-quality frames. Existing video restoration methods generally fall into two extreme cases, i.e., they either restore all frames in parallel or restore the video frame by frame in a recurrent way, which would result in different merits and drawbacks. Typically, the former has the advantage of temporal information fusion. However, it suffers from large model size and intensive memory consumption; the latter has a relatively small model size as it shares parameters across frames; however, it lacks long-range dependency modeling ability and parallelizability. In this paper, we attempt to integrate the advantages of the two cases by proposing a recurrent video restoration transformer, namely RVRT. RVRT processes local neighboring frames in parallel within a globally recurrent framework which can achieve a good trade-off between model size, effectiveness, and efficiency. Specifically, RVRT divides the video into multiple clips and uses the previously inferred clip feature to estimate the subsequent clip feature. Within each clip, different frame features are jointly updated with implicit feature aggregation. Across different clips, the guided deformable attention is designed for clip-to-clip alignment, which predicts multiple relevant locations from the whole inferred clip and aggregates their features by the attention mechanism. Extensive experiments on video super-resolution, deblurring, and denoising show that the proposed RVRT achieves state-of-the-art performance on benchmark datasets with balanced model size, testing memory and runtime.

READ FULL TEXT
research
01/28/2022

VRT: A Video Restoration Transformer

Video restoration (e.g., video super-resolution) aims to restore high-qu...
research
04/12/2022

Unidirectional Video Denoising by Mimicking Backward Recurrent Modules with Look-ahead Forward Ones

While significant progress has been made in deep video denoising, it rem...
research
11/30/2021

Revisiting Temporal Alignment for Video Restoration

Long-range temporal alignment is critical yet challenging for video rest...
research
09/13/2023

Aggregating Long-term Sharp Features via Hybrid Transformers for Video Deblurring

Video deblurring methods, aiming at recovering consecutive sharp frames ...
research
04/27/2021

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

A recurrent structure is a popular framework choice for the task of vide...
research
12/03/2020

EVRNet: Efficient Video Restoration on Edge Devices

Video transmission applications (e.g., conferencing) are gaining momentu...
research
04/13/2023

Gated Multi-Resolution Transfer Network for Burst Restoration and Enhancement

Burst image processing is becoming increasingly popular in recent years....

Please sign up or login with your details

Forgot password? Click here to reset