Skip to main navigation Skip to search Skip to main content

Exploring the Usage of Pre-trained Features for Stereo Matching

  • Beihang University
  • RIKEN
  • The University of Tokyo
  • University of York

Research output: Contribution to journalArticlepeer-review

Abstract

For many vision tasks, utilizing pre-trained features results in improved performance and consistently benefits from the rapid advancement of pre-training technologies. However, in the field of stereo matching, the use of pre-trained features has not been extensively researched. In this paper, we present the first systematical exploration into the utilization of pre-trained features for stereo matching. To provide flexible employment for any combination of pre-trained backbones and stereo matching networks, we develop the deformable neck (DN) that decouples the network architectures of these two components. The core idea of DN is to utilize the deformable attention mechanism to iteratively fuse pre-trained features from shallow to deep layers. Empirically, our exploration reveals the crucial factors that influence using pre-trained features for stereo matching. We further investigate the role of instance-level information of pre-trained features, demonstrating it benefits stereo matching while can be suppressed during convolution-based feature fusion. Built on the attention mechanism, the proposed DN module effectively utilizes the instance-level information in pre-trained features. Besides, we provide an understanding of the efficiency-accuracy tradeoff, concluding that using pre-trained features can also be a good alternative with efficiency consideration.

Original languageEnglish
Pages (from-to)4305-4326
Number of pages22
JournalInternational Journal of Computer Vision
Volume132
Issue number10
DOIs
StatePublished - Oct 2024

Keywords

  • Feature adaptation
  • Network architecture
  • Stereo matching
  • Transfer learning
  • Vision pre-trained models

Fingerprint

Dive into the research topics of 'Exploring the Usage of Pre-trained Features for Stereo Matching'. Together they form a unique fingerprint.

Cite this