TY - GEN
T1 - A Novel Patch Selection Approach for Quality Assessment on Short-Form Videos
AU - Wen, Shijie
AU - Jiang, Lai
AU - Qiao, Minglang
AU - Xu, Mai
AU - Deng, Xin
AU - Li, Shengxi
AU - Duan, Yiping
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Video quality assessment (VQA) on short-form videos plays a critical role in optimizing online multimedia systems. Traditional VQA approaches have made significant progress with the widespread applications of deep learning. However, the quality assessment of short-form videos has re-ceived insufficient research attention with limited performance, neglecting the unique perceptual characteristics of human over short-form videos, e.g., large transfer distance and scattered distribution of fixations. In this paper, we propose the patch selection VQA (PS-VQA) approach to promote this field by incorporating human attention mechanisms during short-form video viewing. In our PS-VQA approach, we first introduce a patch selection network (PS-Net) to identify the patches most relevant to perceived video quality in short-form videos. Subsequently, we develop a quality assessment network (QA-Net) to extract both local and global quality-aware features from the selected patches to predict the overall video quality. Extensive experimental results validate the effectiveness of the proposed VQA approach on short-form videos. Interestingly, our ablation experiments show that accurately assessing the quality of a short-form video merely requires 5 % pixels of the whole frame.
AB - Video quality assessment (VQA) on short-form videos plays a critical role in optimizing online multimedia systems. Traditional VQA approaches have made significant progress with the widespread applications of deep learning. However, the quality assessment of short-form videos has re-ceived insufficient research attention with limited performance, neglecting the unique perceptual characteristics of human over short-form videos, e.g., large transfer distance and scattered distribution of fixations. In this paper, we propose the patch selection VQA (PS-VQA) approach to promote this field by incorporating human attention mechanisms during short-form video viewing. In our PS-VQA approach, we first introduce a patch selection network (PS-Net) to identify the patches most relevant to perceived video quality in short-form videos. Subsequently, we develop a quality assessment network (QA-Net) to extract both local and global quality-aware features from the selected patches to predict the overall video quality. Extensive experimental results validate the effectiveness of the proposed VQA approach on short-form videos. Interestingly, our ablation experiments show that accurately assessing the quality of a short-form video merely requires 5 % pixels of the whole frame.
KW - Short-form video
KW - human attention
KW - video quality assessment
UR - https://www.scopus.com/pages/publications/85217522035
U2 - 10.1109/WCSP62071.2024.10826890
DO - 10.1109/WCSP62071.2024.10826890
M3 - 会议稿件
AN - SCOPUS:85217522035
T3 - 16th International Conference on Wireless Communications and Signal Processing, WCSP 2024
SP - 1361
EP - 1367
BT - 16th International Conference on Wireless Communications and Signal Processing, WCSP 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 16th International Conference on Wireless Communications and Signal Processing, WCSP 2024
Y2 - 24 October 2024 through 26 October 2024
ER -