Skip to main navigation Skip to search Skip to main content

Two-hand Pose Estimation from the non-cropped RGB Image with Self-Attention Based Network

  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Estimating the pose of two hands is a crucial problem for many human-computer interaction applications. Since most of the existing works utilize cropped images to predict the hand pose, they require a hand detection stage before pose estimation or input cropped images directly. In this paper, we propose the first real-time one-stage method for pose estimation from a single RGB image without hand tracking. Combining the self-attention mechanism with convolutional layers, the network we proposed is able to predict the 2.5D hand joints coordinate while locating the two hands regions. And to reduce the extra memory and computational consumption caused by self-attention, we proposed a linear attention structure with a spatial-reduction attention block called SRAN block. We demonstrate the effectiveness of each component in our network through the ablation study. And experiments on public datasets showed the competitive result with the state-of-the-art method.

Original languageEnglish
Title of host publicationProceedings - 2021 IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2021
EditorsMaud Marchal, Jonathan Ventura, Anne-Helene Olivier, Lili Wang, Rafael Radkowski
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages248-255
Number of pages8
ISBN (Electronic)9781665401586
DOIs
StatePublished - 2021
Event20th IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2021 - Virtual, Online, Italy
Duration: 4 Oct 20218 Oct 2021

Publication series

NameProceedings - 2021 IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2021

Conference

Conference20th IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2021
Country/TerritoryItaly
CityVirtual, Online
Period4/10/218/10/21

Keywords

  • Artificial intelligence
  • Computer version
  • Human computer interaction(HCI)
  • Interaction paradigms
  • Mixed / augmented reality
  • Pose estimation

Fingerprint

Dive into the research topics of 'Two-hand Pose Estimation from the non-cropped RGB Image with Self-Attention Based Network'. Together they form a unique fingerprint.

Cite this