Skip to main navigation Skip to search Skip to main content

Joint depth estimation and multi-model SLAM for robust perception in structure-degraded environments

  • Weipeng Wang
  • , Wenxuan Ji
  • , Jin Xiao*
  • , Xiaoguang Hu
  • , Zichong Jia
  • , Jiaqi Shi
  • *Corresponding author for this work
  • Beihang University

Research output: Contribution to journalArticlepeer-review

Abstract

Robust spatial perception is essential for SLAM in robotics and autonomous systems, but existing pipelines often fail in structure-deficient scenes when relying on a single modality or decoupling depth estimation from SLAM. We present a joint depth-enhanced, multi-model SLAM system tailored for such scenarios with three core contributions: First, we propose a multi-model depth fusion framework (MDFF) that fuses visual, LiDAR, inertial, and learned depth cues; Second, we design a dense scan-to-map module (DSM) within the LiDAR–Inertial Subsystem (LIS) that eliminates handcrafted features; Third, we develop a depth-aware backend optimization (DBO) that jointly refines poses, landmarks, and scale using multi and single-view depth constraints. The system targets high-throughput computing, with embarrassingly parallel per-point residuals and GPU-ready depth inference. Experiments show that DSM reduces LiDAR-inertial processing time versus LVI-SAM while the full pipeline runs in real time (21.5 FPS LiDAR, 28.6 FPS camera) and delivers higher localization accuracy than representative baselines.

Original languageEnglish
Article number306
JournalJournal of Supercomputing
Volume82
Issue number5
DOIs
StatePublished - Apr 2026

Keywords

  • Depth estimation
  • Multi-model
  • SLAM
  • Structure-degraded environments

Fingerprint

Dive into the research topics of 'Joint depth estimation and multi-model SLAM for robust perception in structure-degraded environments'. Together they form a unique fingerprint.

Cite this