摘要
Deep learning based camera localization from a single image has been explored recently since these methods are computationally efficient. However, existing methods only provide general global representations, from which an accurate pose estimation can not be reliably derived. We claim that effective feature representations for accurate pose estimation shall be both "informative" (focusing on geometrically meaningful regions) and "discriminative" (accounting for different poses of similar images). Therefore, we propose a novel multi-layer factorized bilinear pooling module for feature aggregation. Specifically, informative features are selected via bilinear pooling, and discriminative features are highlighted via multi-layer fusion. We develop a new network for camera localization using the proposed feature pooling module. The effectiveness of our approach is demonstrated by experiments on an outdoor Cambridge Landmarks dataset and an indoor 7 Scenes dataset. The results show that focusing on discriminative features significantly improves the network performance of camera localization in most cases. Codes will be available soon.
| 源语言 | 英语 |
|---|---|
| 出版状态 | 已出版 - 2020 |
| 活动 | 30th British Machine Vision Conference, BMVC 2019 - Cardiff, 英国 期限: 9 9月 2019 → 12 9月 2019 |
会议
| 会议 | 30th British Machine Vision Conference, BMVC 2019 |
|---|---|
| 国家/地区 | 英国 |
| 市 | Cardiff |
| 时期 | 9/09/19 → 12/09/19 |
指纹
探究 'Discriminative features matter: Multi-layer bilinear pooling for camera localization' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver