跳到主要导航 跳到搜索 跳到主要内容

Diffusion model with temporal constraint for 3D human pose estimation

  • Zhangmeng Chen
  • , Ju Dai*
  • , Junjun Pan*
  • , Feng Zhou
  • *此作品的通讯作者
  • Beihang University
  • Peng Cheng Laboratory
  • North China University of Technology

科研成果: 期刊稿件文章同行评审

摘要

3D human pose estimation has received increasing attention as it is the foundation for many downstream tasks. However, this task is challenging due to inherent depth ambiguity and occlusion issues. Thanks to the ability of diffusion models to generate multiple hypotheses, they are promising in reducing uncertainty in results. Inspired by this, we propose a diffusion-based temporal constraint framework for 3D human pose estimation, called DTCPose, which generates multiple 3D candidate poses with 2D poses as conditions to synthesize the final pose to improve estimation accuracy. Simultaneously, to ensure the temporal stability of the 3D output sequences, we introduce temporal constraints into the model to reduce the jitter of the results. Extensive experiments on Human3.6M and MPI-INF3DHP datasets demonstrate that our approach performs predominantly in both single-hypothesis and multi-hypothesis 3D human pose estimation. Code will be available at: https://github.com/czmmmm/DCTPose.

源语言英语
页(从-至)5961-5977
页数17
期刊Visual Computer
41
8
DOI
出版状态已出版 - 6月 2025

指纹

探究 'Diffusion model with temporal constraint for 3D human pose estimation' 的科研主题。它们共同构成独一无二的指纹。

引用此