Abstract
Place recognition plays a crucial role in the fields of robotics and computer vision, finding applications in areas such as autonomous driving, mapping, and localization. Place recognition identifies a place using query sensor data and a known database. One of the main challenges is to develop a model that can deliver accurate results while being robust to environmental variations. We propose two multi-modal place recognition models, namely PRFusion and PRFusion++. PRFusion utilizes global fusion with manifold metric attention, enabling effective interaction between features without requiring camera-LiDAR extrinsic calibrations. In contrast, PRFusion++ assumes the availability of extrinsic calibrations and leverages pixel-point correspondences to enhance feature learning on local windows. Additionally, both models incorporate neural diffusion layers, which enable reliable operation even in challenging environments. We verify the state-of-the-art performance of both models on three large-scale benchmarks. Notably, they outperform existing models by a substantial margin of +3.0 AR@1 on the demanding Boreas dataset. Furthermore, we conduct ablation studies to validate the effectiveness of our proposed methods.
| Original language | English |
|---|---|
| Pages (from-to) | 20523-20534 |
| Number of pages | 12 |
| Journal | IEEE Transactions on Intelligent Transportation Systems |
| Volume | 25 |
| Issue number | 12 |
| DOIs | |
| State | Published - 2024 |
| Externally published | Yes |
Keywords
- manifold metric attention
- multi-modal fusion
- neural diffusion
- Place recognition
Fingerprint
Dive into the research topics of 'PRFusion: Toward Effective and Robust Multi-Modal Place Recognition With Image and Point Cloud Fusion'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver