Skip to main navigation Skip to search Skip to main content

Saliency Prediction of Traffic Surveillance Videos: A Benchmark and A Multi-Task Approach

  • Beihang University
  • China Tower Corporation Limited

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Traffic surveillance videos are crucial for the devel-opment of intelligent transportation systems. The huge number of these videos has introduced significant challenges for storage and transmission, etc. Therefore, efficient and accurate video saliency prediction (VSP) can benefit a wide range of video processing techniques for traffic scenes, such as video compression, smart navigation, and traffic event detection. However, currently there are no VSP approaches or eye-tracking datasets dedicated to traffic surveillance videos. In this paper, we establish a large-scale eye-tracking dataset, dubbed traffic surveillance videos 1K (TSV1K). TSV1K contains 1000 high-quality traffic surveil-lance videos, with eye-tracking annotations from 30 subjects. Based on our dataset, we conduct thorough analysis on the correlations between human attention and traffic scenes, e.g., vehicle distribution and scene complexity. Accordingly, we pro-pose a multi-task traffic saliency prediction network (MTTS-Net), which leverages the task of traffic salient object detection (TSOD) to promote the performance of the VSP task. In order to better learn these tasks, a two-stage training strategy is developed to progressively train the MTTS- Net. Experimental results demonstrate that our proposed approach outperforms the state-of-the-art approaches in both tasks of TSOD and VSP on traffic surveillance videos. Our dataset and code are available on https://github.com/giteec/TSV1K.

Original languageEnglish
Title of host publication16th International Conference on Wireless Communications and Signal Processing, WCSP 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1355-1360
Number of pages6
ISBN (Electronic)9798350390643
DOIs
StatePublished - 2024
Event16th International Conference on Wireless Communications and Signal Processing, WCSP 2024 - Hefei, China
Duration: 24 Oct 202426 Oct 2024

Publication series

Name16th International Conference on Wireless Communications and Signal Processing, WCSP 2024

Conference

Conference16th International Conference on Wireless Communications and Signal Processing, WCSP 2024
Country/TerritoryChina
CityHefei
Period24/10/2426/10/24

Keywords

  • Multi - task learning
  • Saliency prediction
  • Traffic surveillance video

Fingerprint

Dive into the research topics of 'Saliency Prediction of Traffic Surveillance Videos: A Benchmark and A Multi-Task Approach'. Together they form a unique fingerprint.

Cite this