跳到主要导航 跳到搜索 跳到主要内容

∗ 面向复杂工业场景的人体姿态估计性能增强方法

  • Li Fanya
  • , Zhang Zehui*
  • , Chen Boyang
  • , Xu Xiaobin
  • , Guan Cong
  • *此作品的通讯作者
  • Hangzhou Dianzi University
  • Ningxia Petrochemical Yinjun Safety Technology Consulting Co.,Ltd
  • Wuhan University of Technology

科研成果: 期刊稿件文章同行评审

摘要

Human pose estimation is one of the important supporting technologies for Industrial Manufacturing 5. 0, which has already been applied in various scenarios such as action recognition, human-computer interaction, and digital twin. However, in complex industrial scenes, objects such as notice boards, pipes, and columns can easily cause local or global occlusions for workers, leading to errors in joint points localization by human pose estimation models and a decrease in the performance of the human pose estimation model. To address this problem, this article proposes a human pose estimation performance enhancement method for complex industrial scenes, which firstly structurally models the key points of the human body based on VQ-VAE model, mapping joint features to a quantized latent space to improve the accuracy of human pose estimation when occlusion occurred. Then, to address the problem of insufficient worker occlusion dataset, a dynamic data augmentation and training method is innovatively proposed. In the process of model training, industrial scene-specific worker occlusion images are generated dynamically using real industrial scene occlusion objects by evaluating the human pose estimation results of the model for the next model training, further enhancing the model′s robustness in human pose estimation tasks. The experimental results show that the method proposed in this article achieves an average precision ( AP ) improvement of 3. 8% and an average recall ( AR) improvement of 2. 7% over the advanced method PCT on the self-constructed dataset and is able to effectively cope with the human occlusion problem in complex industrial scenes.

投稿的翻译标题Human pose estimation performance enhancement method for complex industrial scenes
源语言繁体中文
页(从-至)255-265
页数11
期刊Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument
46
8
DOI
出版状态已出版 - 2025
已对外发布

关键词

  • computer vision
  • data augmentation
  • human pose estimation
  • industrial occlusion

指纹

探究 '∗ 面向复杂工业场景的人体姿态估计性能增强方法' 的科研主题。它们共同构成独一无二的指纹。

引用此