Transformer-based Image Compression

11/12/2021
by   Ming Lu, et al.
0

A Transformer-based Image Compression (TIC) approach is developed which reuses the canonical variational autoencoder (VAE) architecture with paired main and hyper encoder-decoders. Both main and hyper encoders are comprised of a sequence of neural transformation units (NTUs) to analyse and aggregate important information for more compact representation of input image, while the decoders mirror the encoder-side operations to generate pixel-domain image reconstruction from the compressed bitstream. Each NTU is consist of a Swin Transformer Block (STB) and a convolutional layer (Conv) to best embed both long-range and short-range information; In the meantime, a casual attention module (CAM) is devised for adaptive context modeling of latent features to utilize both hyper and autoregressive priors. The TIC rivals with state-of-the-art approaches including deep convolutional neural networks (CNNs) based learnt image coding (LIC) methods and handcrafted rules-based intra profile of recently-approved Versatile Video Coding (VVC) standard, and requires much less model parameters, e.g., up to 45 leading-performance LIC.

READ FULL TEXT

page 8

page 9

research
12/17/2021

Towards End-to-End Image Compression and Analysis with Transformers

We propose an end-to-end image compression and analysis model with Trans...
research
09/19/2023

Multi-Context Dual Hyper-Prior Neural Image Compression

Transform and entropy models are the two core components in deep image c...
research
07/12/2023

AICT: An Adaptive Image Compression Transformer

Motivated by the efficiency investigation of the Tranformer-based transf...
research
07/05/2023

Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression

Recently, the performance of neural image compression (NIC) has steadily...
research
09/05/2022

Uformer-ICS: A Specialized U-Shaped Transformer for Image Compressive Sensing

Recently, several studies have applied deep convolutional neural network...
research
04/25/2022

High-Efficiency Lossy Image Coding Through Adaptive Neighborhood Information Aggregation

Questing for lossy image coding (LIC) with superior efficiency on both c...

Please sign up or login with your details

Forgot password? Click here to reset