Skip to main navigation Skip to search Skip to main content

Towards spatial computing: recent advances in multimodal natural interaction for Extended Reality headsets

  • Zhi Min Wang
  • , Mao Hang Rao
  • , Shang Hua Ye
  • , Wei Tao Song
  • , Feng Lu*
  • *Corresponding author for this work
  • Beihang University
  • Beijing Institute of Technology

Research output: Contribution to journalReview articlepeer-review

Abstract

With the widespread adoption of Extended Reality (XR) headsets, spatial computing technologies are gaining increasing attention. Spatial computing enables interaction with virtual elements through natural input methods such as eye tracking, hand gestures, and voice commands, thus placing natural human-computer interaction at its core. While previous surveys have reviewed conventional XR interaction techniques, recent advancements in natural interaction, particularly driven by artificial intelligence (AI) and large language models (LLMs), have introduced new paradigms and technologies. In this paper, we review research on multimodal natural interaction for wearable XR, focusing on papers published since 2022 in six top venues: ACM CHI, UIST, IMWUT (Ubicomp), IEEE VR, ISMAR, and TVCG. We classify and analyze these studies based on application scenarios, operation types, and interaction modalities. This analysis provides a structured framework for understanding how researchers are designing advanced natural interaction techniques in XR. Based on these findings, we discuss the challenges in natural interaction techniques and suggest potential directions for future research. This review provides valuable insights for researchers aiming to design natural and efficient interaction systems for XR, ultimately contributing to the advancement of spatial computing.

Original languageEnglish
Article number1912708
JournalFrontiers of Computer Science
Volume19
Issue number12
DOIs
StatePublished - Dec 2025

Keywords

  • extended reality
  • eye
  • hand
  • multimodal
  • natural interaction
  • speech

Fingerprint

Dive into the research topics of 'Towards spatial computing: recent advances in multimodal natural interaction for Extended Reality headsets'. Together they form a unique fingerprint.

Cite this