Skip to main navigation Skip to search Skip to main content

RSVP: A real-time surveillance video parsing system with single frame supervision

  • Han Yu
  • , Guanghui Ren
  • , Ruihe Qian
  • , Yao Sun
  • , Changhu Wang
  • , Hanqing Lu
  • , Si Liu*
  • *Corresponding author for this work
  • CAS - Institute of Information Engineering
  • Toutiao AI Lab.
  • Chinese Academy of Sciences

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this demo, we present a real-time surveillance video parsing (RSVP) system to parse surveillance videos. Surveillance video parsing, which aims to segment the video frames into several labels, e.g., face, pants, left-legs, has wide applications, especially in security filed. However, it is very tedious and time-consuming to annotate all the frames in a video. We design a RSVP system to parse the surveillance videos in real-time. The RSVP system requires only one labeled frame in training stage. The RSVP system jointly considers the segmentation of preceding frames when parsing one particular frame within the video. The RSVP system is proved to be effective and efficient in real applications.

Original languageEnglish
Title of host publicationMM 2017 - Proceedings of the 2017 ACM Multimedia Conference
PublisherAssociation for Computing Machinery, Inc
Pages1257-1258
Number of pages2
ISBN (Electronic)9781450349062
DOIs
StatePublished - 23 Oct 2017
Externally publishedYes
Event25th ACM International Conference on Multimedia, MM 2017 - Mountain View, United States
Duration: 23 Oct 201727 Oct 2017

Publication series

NameMM 2017 - Proceedings of the 2017 ACM Multimedia Conference

Conference

Conference25th ACM International Conference on Multimedia, MM 2017
Country/TerritoryUnited States
CityMountain View
Period23/10/1727/10/17

Keywords

  • Deep learning
  • Human parsing
  • Surveillance video parsing system

Fingerprint

Dive into the research topics of 'RSVP: A real-time surveillance video parsing system with single frame supervision'. Together they form a unique fingerprint.

Cite this