Skip to main navigation Skip to search Skip to main content

Assessing Action Quality via Attentive Spatio-Temporal Convolutional Networks

  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Action quality assessment, which aims at evaluating the performance of specific actions, has drawn more and more attention due to its extensive demand in sports, health care, etc. Unlike action recognition, in which a few typical frames are sufficient for classification, action quality assessment requires analysis at a fine temporal granularity to discover the subtle motion difference. In this paper, we propose a novel spatio-temporal framework for action quality assessment at full-frame-rate (25fps), which consists of two steps: i.e. spatio-temporal feature extraction and temporal feature fusion, respectively. In the first step, to generate representative spatio-temporal dynamics, we utilize a spatial convolutional network (SCN) together with specially designed temporal convolutional networks (TCNs) and train them by a two-stage strategy. In the second step, we introduce an attention mechanism to fuse features in the temporal dimension according to their impact on the overall performance. Compared with existing three dimensional convolutional neural networks (3D-CNN) based methods, our model is capable of capturing more action quality relevant details. As a by-product, our model can also attend to the highlight moments in sports videos, which gives a better interpretation of the score. Extensive experiments on three public benchmarks demonstrate that the proposed method has distinct advantage in action quality assessment and achieves improvement over the state-of-the-art.

Original languageEnglish
Title of host publicationPattern Recognition and Computer Vision - 3rd Chinese Conference, PRCV 2020, Proceedings
EditorsYuxin Peng, Hongbin Zha, Qingshan Liu, Huchuan Lu, Zhenan Sun, Chenglin Liu, Xilin Chen, Jian Yang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages3-16
Number of pages14
ISBN (Print)9783030606381
DOIs
StatePublished - 2020
Event3rd Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2020 - Nanjing, China
Duration: 16 Oct 202018 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12306 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference3rd Chinese Conference on Pattern Recognition and Computer Vision, PRCV 2020
Country/TerritoryChina
CityNanjing
Period16/10/2018/10/20

Keywords

  • Action quality assessment
  • Attentive fusion
  • Temporal convolution

Fingerprint

Dive into the research topics of 'Assessing Action Quality via Attentive Spatio-Temporal Convolutional Networks'. Together they form a unique fingerprint.

Cite this