Skip to main navigation Skip to search Skip to main content

LMFSNet: A Lightweight Multi-Level Fusion Hand Gesture Segmentation Network for Human-Robot Interaction

  • Yang Li
  • , Jing Qi*
  • , Zhenchao Cui
  • , Kun Xu
  • , Xilun Ding
  • *Corresponding author for this work
  • Hebei University
  • Beihang University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Traditional gesture-based human-robot interaction relies on one-to-one gesture-command mapping, requiring numerous gestures and imposing high cognitive load. Existing networks are often computationally heavy, limiting real-time deployment on resource-constrained robots. To significantly reduce the required gestures and improve the intuitiveness of user experience, we develop the Spatial Semantic Mapping framework to change the gesture-based control paradigm by assigning commands based on the spatial position of the hand, establishing a flexible one-to-many mapping. To achieve an optimal balance between accuracy and computational efficiency, we propose a Lightweight Multi-level Fusion Segmentation Network (LMFSNet). Firstly, to reduce computational costs greatly, we propose a lightweight Residual Axial Group Convolution as the core operation of the model. Secondly, to maintain high performance in the lightweight network, we design two modules: Dynamic Adaptive Attention Block (DAAB) and Long-Short Distance Extraction (LSDE) block. Specifically, the DAAB dynamically reweights features to focus on important information, and the LSDE effectively captures and fuses multi-scale features. Experimental results show that the proposed LMFSNet achieves state-of-the-art accuracy while maintaining real-time speed and a compact model size.

Original languageEnglish
Title of host publicationProceedings - 2025 International Conference on Virtual Reality and Visualization, ICVRV 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages282-288
Number of pages7
ISBN (Electronic)9798331556297
DOIs
StatePublished - 2025
Event2025 International Conference on Virtual Reality and Visualization, ICVRV 2025 - Bogota, Colombia
Duration: 19 Dec 202521 Dec 2025

Publication series

NameProceedings - 2025 International Conference on Virtual Reality and Visualization, ICVRV 2025

Conference

Conference2025 International Conference on Virtual Reality and Visualization, ICVRV 2025
Country/TerritoryColombia
CityBogota
Period19/12/2521/12/25

Keywords

  • Hand Gesture Segmentation
  • Human-Robot Interaction
  • Lightweight Segmentation Network

Fingerprint

Dive into the research topics of 'LMFSNet: A Lightweight Multi-Level Fusion Hand Gesture Segmentation Network for Human-Robot Interaction'. Together they form a unique fingerprint.

Cite this