Skip to main navigation Skip to search Skip to main content

TVFormer: Trajectory-guided Visual Quality Assessment on 360° Images with Transformers

  • Li Yang
  • , Mai Xu*
  • , Tie Liu
  • , Liangyu Huo
  • , Xinbo Gao
  • *Corresponding author for this work
  • Beihang University
  • Chongqing University of Posts and Telecommunications

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Visual quality assessment (VQA) on 360° images plays an important role in optimizing immersive multimedia systems. Due to the absence of pristine 360° images in real world, blind VQA (BVQA) on 360° images has drawn much research attention. In subjective VQA on 360^ images, human intuitively make the quality-scoring decisions through the quality degradation of each observed viewport on the head trajectories. Unfortunately, the existing BVQA works for 360° images neglect the dynamic property of head trajectories with viewport interactions, thus failing to obtain human-like quality scores. In this paper, we propose a novel Transformer-based approach for trajectory-guided VQA on 360° images (named TVFormer), in which both the tasks of head trajectory prediction and BVQA can be accomplished for 360° images. In the first task, we develop a trajectory-Aware memory updater (TMU) module, for maintaining the coherence and accuracy of predicted head trajectories. To capture the long-range quality dependency across time-ordered viewports, we propose a spatio-Temporal factorized self-Attention (STF) module in the encoder of TVFormer for the BVQA task. By implanting the predicted head trajectories into the BVQA task, we can obtain the human-like quality scores. Extensive experiments demonstrate the superior BVQA performance of TVFormer over state-of-The-Art approaches on three benchmark datasets.

Original languageEnglish
Title of host publicationMM 2022 - Proceedings of the 30th ACM International Conference on Multimedia
PublisherAssociation for Computing Machinery, Inc
Pages799-808
Number of pages10
ISBN (Electronic)9781450392037
DOIs
StatePublished - 10 Oct 2022
Event30th ACM International Conference on Multimedia, MM 2022 - Lisboa, Portugal
Duration: 10 Oct 202214 Oct 2022

Publication series

NameMM 2022 - Proceedings of the 30th ACM International Conference on Multimedia

Conference

Conference30th ACM International Conference on Multimedia, MM 2022
Country/TerritoryPortugal
CityLisboa
Period10/10/2214/10/22

Keywords

  • 360
  • BVQA
  • head trajectory
  • images
  • transformer

Fingerprint

Dive into the research topics of 'TVFormer: Trajectory-guided Visual Quality Assessment on 360° Images with Transformers'. Together they form a unique fingerprint.

Cite this