Skip to main navigation Skip to search Skip to main content

A generic framework for event detection in various video domains

  • Tianzhu Zhang*
  • , Changsheng Xu
  • , Guangyu Zhu
  • , Si Liu
  • , Hanqing Lu
  • *Corresponding author for this work
  • CAS - Institute of Automation
  • China-Singapore Institute of Digital Media
  • National University of Singapore

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Event detection is essential for the extensively studied video analysis and understanding area. Although various approaches have been proposed for event detection, there is a lack of a generic event detection framework that can be applied to various video domains (e.g. sports, news, movies, surveillance). In this paper, we present a generic event detection approach based on semi-supervised learning and Internet vision. Concretely, a Graph-based Semi-Supervised Multiple Instance Learning (GSSMIL) algorithm is proposed to jointly explore small-scale expert labeled videos and large-scale unlabeled videos to train the event models to detect video event boundaries. The expert labeled videos are obtained from the analysis and alignment of well-structured video related text (e.g. movie scripts, web-casting text, close caption). The unlabeled data are obtained by querying related events from the video search engine (e.g. YouTube) in order to give more distributive information for event modeling. A critical issue of GSSMIL in constructing a graph is the weight assignment, where the weight of an edge specifies the similarity between two data points. To tackle this problem, we propose a novel Multiple Instance Learning Induced Similarity (MILIS) measure by learning instance sensitive classifiers. We perform the thorough experiments in three popular video domains: movies, sports and news. The results compared with the state-of-the-arts are promising and demonstrate our proposed approach is performance-effective.

Original languageEnglish
Title of host publicationMM'10 - Proceedings of the ACM Multimedia 2010 International Conference
Pages103-112
Number of pages10
DOIs
StatePublished - 2010
Externally publishedYes
Event18th ACM International Conference on Multimedia ACM Multimedia 2010, MM'10 - Firenze, Italy
Duration: 25 Oct 201029 Oct 2010

Publication series

NameMM'10 - Proceedings of the ACM Multimedia 2010 International Conference

Conference

Conference18th ACM International Conference on Multimedia ACM Multimedia 2010, MM'10
Country/TerritoryItaly
CityFirenze
Period25/10/1029/10/10

Keywords

  • broadcast video
  • event detection
  • internet
  • multiple instance learning
  • semi-supervised learning
  • web-casting text

Fingerprint

Dive into the research topics of 'A generic framework for event detection in various video domains'. Together they form a unique fingerprint.

Cite this