MSTRIQ: No Reference Image Quality Assessment Based on Swin Transformer with Multi-Stage Fusion

by   Jing Wang, et al.

Measuring the perceptual quality of images automatically is an essential task in the area of computer vision, as degradations on image quality can exist in many processes from image acquisition, transmission to enhancing. Many Image Quality Assessment(IQA) algorithms have been designed to tackle this problem. However, it still remains un settled due to the various types of image distortions and the lack of large-scale human-rated datasets. In this paper, we propose a novel algorithm based on the Swin Transformer [31] with fused features from multiple stages, which aggregates information from both local and global features to better predict the quality. To address the issues of small-scale datasets, relative rankings of images have been taken into account together with regression loss to simultaneously optimize the model. Furthermore, effective data augmentation strategies are also used to improve the performance. In comparisons with previous works, experiments are carried out on two standard IQA datasets and a challenge dataset. The results demonstrate the effectiveness of our work. The proposed method outperforms other methods on standard datasets and ranks 2nd in the no-reference track of NTIRE 2022 Perceptual Image Quality Assessment Challenge [53]. It verifies that our method is promising in solving diverse IQA problems and thus can be used to real-word applications.


page 4

page 6


Related Work on Image Quality Assessment

Due to the existence of quality degradations introduced in various stage...

Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token

Image quality assessment is a fundamental problem in the field of image ...

JNDMix: JND-Based Data Augmentation for No-reference Image Quality Assessment

Despite substantial progress in no-reference image quality assessment (N...

Interpretable Image Quality Assessment via CLIP with Multiple Antonym-Prompt Pairs

No reference image quality assessment (NR-IQA) is a task to estimate the...

RTN: Reinforced Transformer Network for Coronary CT Angiography Vessel-level Image Quality Assessment

Coronary CT Angiography (CCTA) is susceptible to various distortions (e....

Data-Efficient Image Quality Assessment with Attention-Panel Decoder

Blind Image Quality Assessment (BIQA) is a fundamental task in computer ...

KonX: Cross-Resolution Image Quality Assessment

Scale-invariance is an open problem in many computer vision subfields. F...

Please sign up or login with your details

Forgot password? Click here to reset