跳到主要导航 跳到搜索 跳到主要内容

MASGC: Hybrid attention and synchronous graph learning for monocular 3D pose estimation

  • Shengjie Li
  • , Jin Wang*
  • , Jianwei Niu
  • , Yuanhang Wang
  • , Haiyun Zhang
  • , Guodong Lu
  • , Jingru Yang
  • , Xiaolong Yu
  • , Renluan Hou
  • *此作品的通讯作者

科研成果: 期刊稿件文章同行评审

摘要

Occlusion and depth ambiguity pose significant challenges to the accuracy of monocular 3D human pose estimation. To tackle these issues, this study presents a two-stage pose estimation method based on Multi-Attention and Synchronous-Graph-Convolution (MASGC). In the first stage (2D pose estimation), a feature pyramid convolutional attention (FPCA) module is designed based on a multiresolution feature pyramid (MFP) and a convolutional attention triplet (CAT), which integrates channel, coordinate, and spatial attention, enabling the model to focus on the most salient features and mitigate location information loss caused by global pooling, thereby improving estimation accuracy. In the second stage (lifting to 3D), a temporal synchronous graph convolutional network (TSGCN) is designed. By incorporating multi-head attention and expanding the receptive field of end keypoints through topological temporal convolutions, TSGCN effectively addresses the challenges of occlusion and depth ambiguity in monocular 3D human pose estimation. Experimental results show that MASGC outperforms the compared baseline methods on benchmark datasets, including Human3.6 M and a custom dual-arm dataset, while reducing computational complexity compared to mainstream models. The code is available at https://github.com/JasonLi-30/MASGC.

源语言英语
文章编号103284
期刊Displays
92
DOI
出版状态已出版 - 4月 2026

指纹

探究 'MASGC: Hybrid attention and synchronous graph learning for monocular 3D pose estimation' 的科研主题。它们共同构成独一无二的指纹。

引用此