MT-VQA: A Multi-task Approach for Quality Assessment of Short-form Videos

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Short-form video, as a mainstream media form on video platforms, has undergone explosive growth in recent years. A vast number of short-form videos are produced, processed, and distributed to users each day, inevitably leading to quality degradation. Therefore, accurate video quality assessment (VQA) is critical for monitoring and optimizing the viewing experience of users. However, the existing short-form VQA approaches neglect human attention patterns during the viewing of videos. Besides, the advancement of short-form VQA is obstructed by the absence of large-scale datasets. To tackle the above challenges, we first construct a large-scale short-form VQA dataset called SVQA. The SVQA dataset comprises diverse distortion types, covering the typical quality degradations that arise during the photography, encoding, and editing of short-form videos. Besides, for each short-form video in SVQA, we collect both quality score and eye-tracking annotation. Based on our dataset, we propose a two-branch multi-task VQA approach, MT-VQA, in which both tasks of VQA and video saliency prediction (VSP) can be accomplished for short-form videos. We further propose a saliency fusion module to guide the VQA branch to focus on quality distortions within visually attractive regions. Extensive experiments show that our multi-task approach achieves superior performance in both VQA and VSP tasks.

Original languageEnglish
Title of host publicationQoEVMA 2024 - Proceedings of the 3rd Workshop on Quality of Experience in Visual Multimedia Applications
PublisherAssociation for Computing Machinery, Inc
Pages30-38
Number of pages9
ISBN (Electronic)9798400712043
DOIs
StatePublished - 28 Oct 2024
Event3rd Workshop on Quality of Experience in Visual Multimedia Applications, QoEVMA 2024 - Melbourne, Australia
Duration: 28 Oct 20241 Nov 2024

Publication series

NameQoEVMA 2024 - Proceedings of the 3rd Workshop on Quality of Experience in Visual Multimedia Applications

Conference

Conference3rd Workshop on Quality of Experience in Visual Multimedia Applications, QoEVMA 2024
Country/TerritoryAustralia
CityMelbourne
Period28/10/241/11/24

Keywords

  • human attention
  • short-form video
  • video quality assessment

Fingerprint

Dive into the research topics of 'MT-VQA: A Multi-task Approach for Quality Assessment of Short-form Videos'. Together they form a unique fingerprint.

Cite this