跳到主要导航 跳到搜索 跳到主要内容

Two-hand Pose Estimation from the non-cropped RGB Image with Self-Attention Based Network

  • Beihang University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Estimating the pose of two hands is a crucial problem for many human-computer interaction applications. Since most of the existing works utilize cropped images to predict the hand pose, they require a hand detection stage before pose estimation or input cropped images directly. In this paper, we propose the first real-time one-stage method for pose estimation from a single RGB image without hand tracking. Combining the self-attention mechanism with convolutional layers, the network we proposed is able to predict the 2.5D hand joints coordinate while locating the two hands regions. And to reduce the extra memory and computational consumption caused by self-attention, we proposed a linear attention structure with a spatial-reduction attention block called SRAN block. We demonstrate the effectiveness of each component in our network through the ablation study. And experiments on public datasets showed the competitive result with the state-of-the-art method.

源语言英语
主期刊名Proceedings - 2021 IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2021
编辑Maud Marchal, Jonathan Ventura, Anne-Helene Olivier, Lili Wang, Rafael Radkowski
出版商Institute of Electrical and Electronics Engineers Inc.
248-255
页数8
ISBN(电子版)9781665401586
DOI
出版状态已出版 - 2021
活动20th IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2021 - Virtual, Online, 意大利
期限: 4 10月 20218 10月 2021

出版系列

姓名Proceedings - 2021 IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2021

会议

会议20th IEEE International Symposium on Mixed and Augmented Reality, ISMAR 2021
国家/地区意大利
Virtual, Online
时期4/10/218/10/21

指纹

探究 'Two-hand Pose Estimation from the non-cropped RGB Image with Self-Attention Based Network' 的科研主题。它们共同构成独一无二的指纹。

引用此