Skip to main navigation Skip to search Skip to main content

stagNet: An attentive semantic RNN for group activity recognition

  • Mengshi Qi
  • , Jie Qin
  • , Annan Li
  • , Yunhong Wang*
  • , Jiebo Luo
  • , Luc Van Gool
  • *Corresponding author for this work
  • Beihang University
  • Swiss Federal Institute of Technology Zurich
  • Inception Institute of Artificial Intelligence
  • University of Rochester

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Group activity recognition plays a fundamental role in a variety of applications, e.g. sports video analysis and intelligent surveillance. How to model the spatio-temporal contextual information in a scene still remains a crucial yet challenging issue. We propose a novel attentive semantic recurrent neural network (RNN), dubbed as stagNet, for understanding group activities in videos, based on the spatio-temporal attention and semantic graph. A semantic graph is explicitly modeled to describe the spatial context of the whole scene, which is further integrated with the temporal factor via structural-RNN. Benefiting from the ‘factor sharing’ and ‘message passing’ mechanisms, our model is capable of extracting discriminative spatio-temporal features and capturing inter-group relationships. Moreover, we adopt a spatio-temporal attention model to attend to key persons/frames for improved performance. Two widely-used datasets are employed for performance evaluation, and the extensive results demonstrate the superiority of our method.

Original languageEnglish
Title of host publicationComputer Vision – ECCV 2018 - 15th European Conference, 2018, Proceedings
EditorsMartial Hebert, Vittorio Ferrari, Cristian Sminchisescu, Yair Weiss
PublisherSpringer Verlag
Pages104-120
Number of pages17
ISBN (Print)9783030012489
DOIs
StatePublished - 2018
Event15th European Conference on Computer Vision, ECCV 2018 - Munich, Germany
Duration: 8 Sep 201814 Sep 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11214 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference15th European Conference on Computer Vision, ECCV 2018
Country/TerritoryGermany
CityMunich
Period8/09/1814/09/18

Keywords

  • Group activity recognition
  • Scene understanding
  • Semantic graph
  • Spatio-temporal attention

Fingerprint

Dive into the research topics of 'stagNet: An attentive semantic RNN for group activity recognition'. Together they form a unique fingerprint.

Cite this