IQA Vision Transformed: A Survey of Transformer Architectures in Perceptual Image Quality Assessment

Mobeen Ur Rehman, Imran Fareed Nizami, Farman Ullah, Irfan Hussain

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

In an era dominated by visual content, perceptual image quality assessment (IQA) is crucial for enhancing user experiences and driving technological advancements across various domains. This survey paper reviews the integration of Vision Transformers (ViTs) into both no-reference (NR) and full-reference (FR) IQA methods, highlighting their promise as alternatives to traditional techniques. ViTs leverage attention mechanisms to focus selectively on relevant image patches, showing promise in aligning more closely with human perceptual errors. We identify key limitations of conventional IQA methods and track the evolution from early learning-based approaches to contemporary deep learning models, with a specific focus on ViTs.We discuss the performance of Transformer-based models in capturing image distortions and their strong correlation with subjective IQA metrics. We also discuss potential breakthroughs, including the development of hybrid architectures combining Capsule Networks and Transformers, adaptive IQA through meta-learning, and scalable solutions using quantum-inspired computing. These advancements promise to enhance perceptual quality assessment, with substantial implications for industries such as medical imaging, multimedia applications, and beyond. This study aims to set the groundwork for future research in transformer-based methodologies, offering new insights into the transformative impact of these models on IQA.

Original languageBritish English
JournalIEEE Access
DOIs
StateAccepted/In press - 2024

Keywords

  • Attention Mechanisms
  • Capsule Networks
  • Cross-Domain Evaluation
  • Deep Learning for IQA
  • Hybrid Architectures
  • Meta-Learning for IQA
  • Multimedia Applications
  • Perceptual Image Quality Assessment (IQA)
  • Quantum-Inspired Computing
  • Transformer Architectures
  • Vision Transformers (ViTs)

Fingerprint

Dive into the research topics of 'IQA Vision Transformed: A Survey of Transformer Architectures in Perceptual Image Quality Assessment'. Together they form a unique fingerprint.

Cite this