Skip to main navigation Skip to search Skip to main content

SPSC: Sparse and Scalable Multi-Modal 3D Occupancy Prediction for Autonomous Driving

  • Qingju Guo
  • , Shuang Li*
  • , Binhui Xie
  • , Jing Geng*
  • , Wei Li
  • *Corresponding author for this work
  • Beijing Institute of Technology
  • Nanjing University

Research output: Contribution to journalConference articlepeer-review

Abstract

3D semantic occupancy prediction offers a nuanced representation of the surrounding environment, which is crucial for ensuring the safety of autonomous driving. However, fine-grained scene representations inevitably result in cubic growth in data scale, which imposes substantial demands on model architecture and computational complexity, especially in high-resolution scenarios. Existing approaches for handling high-resolution scenes typically obtain fine-grained features by grid sampling on low-resolution feature map, resulting in limited sparsity and insufficient feature interaction. This paper presents a framework leveraging SParse representation and SCalable feature interaction to address the aforementioned challenges, called SPSC. Specifically, we maintain sparsity by progressively pruning unoccupied queries during the coarse-to-fine process, thereby reducing the scale of data that the model needs to handle. Subsequently, we introduce query serialization, which transforms queries into an ordered sequence while preserving their spatial structure. This enables fine-grained feature interaction while maintaining linear computational complexity and a larger receptive field. Without complex architectural designs, SPSC significantly outperforms SOTA approaches, enhances the mIoU by 12.0%, 11.0% and 4.8% on nuScenes-Occupancy dataset under the muli-modal, LiDAR and camera settings, respectively.

Original languageEnglish
Pages (from-to)4430-4438
Number of pages9
JournalProceedings of the AAAI Conference on Artificial Intelligence
Volume40
Issue number6
DOIs
StatePublished - 2026
Event40th AAAI Conference on Artificial Intelligence, AAAI 2026 - Singapore, Singapore
Duration: 20 Jan 202627 Jan 2026

Fingerprint

Dive into the research topics of 'SPSC: Sparse and Scalable Multi-Modal 3D Occupancy Prediction for Autonomous Driving'. Together they form a unique fingerprint.

Cite this