Skip to main navigation Skip to search Skip to main content

Future instance prediction in hybrid spatial-Temporal birds-eye-view representation for vision-centric autonomous driving

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Understanding surrounding dynamics and predicting the future states of instance-level targets are crucial for ensuring safe navigation in autonomous driving. The traditional modular paradigm, which cascades multiple discrete modules through rule-based interfaces, is challenged by information loss and compounded errors. End-To-end motion prediction can directly access sequential sensory data, bypassing redundant intermediate components by utilizing a unified network. Given the advantages of low-cost resources and rich semantics provided by cameras, this work focuses on investigating vision-centric joint perception and prediction for instances of interest. Specifically, we proposed a hybrid spatial-Temporal pyramid module based on 2D convolution kernels and attention mechanisms within the Bird's-Eye View (BEV) plane, which hierarchically extracts global and local contextual information. Additionally, a feature-level motion excitation module is designed to capture and augment the representation of motion-Aware channels. Experiments on the large-scale nuScenes dataset demonstrated that the proposed method outperforms existing algorithms and more accurately predicts target movements over the next 2 seconds across two scene ranges with different resolutions.

Original languageEnglish
Title of host publicationIntelligent Transportation Engineering - Proceedings of the 9th International Conference, ICITE 2024
EditorsGuoqiang Mao
PublisherIOS Press BV
Pages360-370
Number of pages11
ISBN (Electronic)9781643686028
DOIs
StatePublished - 17 Jul 2025
Event9th International Conference on Intelligent Transportation Engineering, ICITE 2024 - Xi'an, China
Duration: 18 Oct 202420 Oct 2024

Publication series

NameAdvances in Transdisciplinary Engineering
Volume72
ISSN (Print)2352-751X
ISSN (Electronic)2352-7528

Conference

Conference9th International Conference on Intelligent Transportation Engineering, ICITE 2024
Country/TerritoryChina
CityXi'an
Period18/10/2420/10/24

Keywords

  • end-To-end learning
  • instance prediction
  • motion forecasting
  • vision-centric autonomous driving

Fingerprint

Dive into the research topics of 'Future instance prediction in hybrid spatial-Temporal birds-eye-view representation for vision-centric autonomous driving'. Together they form a unique fingerprint.

Cite this