一种基于时空运动信息交互建模的三维人体姿态估计方法

Translated title of the contribution: A 3D human pose estimation approach based on spatio-temporal motion interaction modeling
  • Heng Lv
  • , Hongyu Yang*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

3D human pose estimation plays a crucial role in fields such as virtual reality and human-computer interaction. In recent years, the Transformer has been introduced into the domain of 3D human pose estimation to capture the spatiotemporal motion information of human joints. However, existing studies typically focus on the collective movement of joint clusters or exclusively model the movement of individual joints, without delving into the unique movement patterns of each joint and their interdependencies. Consequently, an innovative approach was proposed, which meticulously learnt the spatial information of 2D human joints in each frame and conducted an in-depth analysis of the specific movement patterns of each joint. Through the design of a motion information interaction module based on the Transformer encoder, the proposed method accurately captured the dynamic relationships between different joints. In comparison to existing models that directly learnt the overall motion of human joints, the proposed method enhanced prediction accuracy by approximately 3%. When benchmarked against the state-of-the-art MixSTE model, which primarily focused on individual joint movement, the proposed model demonstrated greater efficiency in capturing spatiotemporal features of joints, achieving an inference speed boost of over 20%, making it especially suitable for real-time inference scenarios.

Translated title of the contributionA 3D human pose estimation approach based on spatio-temporal motion interaction modeling
Original languageChinese (Traditional)
Pages (from-to)158-168
Number of pages11
JournalJournal of Graphics
Volume45
Issue number1
DOIs
StatePublished - 29 Feb 2024

Fingerprint

Dive into the research topics of 'A 3D human pose estimation approach based on spatio-temporal motion interaction modeling'. Together they form a unique fingerprint.

Cite this