Skip to main navigation Skip to search Skip to main content

Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective

  • Jiawei Zhang
  • , Xiang Wang
  • , Xiao Bai*
  • , Chen Wang
  • , Lei Huang
  • , Yimin Chen
  • , Lin Gu
  • , Jun Zhou
  • , Tatsuya Harada
  • , Edwin R. Hancock
  • *Corresponding author for this work
  • Beihang University
  • RIKEN
  • The University of Tokyo
  • Griffith University Queensland
  • University of York

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Despite recent stereo matching networks achieving impressive performance given sufficient training data, they suffer from domain shifts and generalize poorly to unseen domains. We argue that maintaining feature consistency between matching pixels is a vital factor for promoting the generalization capability of stereo matching networks, which has not been adequately considered. Here we address this issue by proposing a simple pixel-wise contrastive learning across the viewpoints. The stereo contrastive feature loss function explicitly constrains the consistency between learned features of matching pixel pairs which are observations of the same 3D points. A stereo selective whitening loss is further introduced to better preserve the stereo feature consistency across domains, which decorrelates stereo features from stereo viewpoint-specific style information. Counter-intuitively, the generalization of feature consistency between two viewpoints in the same scene translates to the generalization of stereo matching performance to unseen domains. Our method is generic in nature as it can be easily embedded into existing stereo networks and does not require access to the samples in the target domain. When trained on synthetic data and generalized to four real-world testing sets, our method achieves superior performance over several state-of-the-art networks. The code is available online11https://github.com/jiaw-z/FCStereo.

Original languageEnglish
Title of host publicationProceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
PublisherIEEE Computer Society
Pages12991-13001
Number of pages11
ISBN (Electronic)9781665469463
DOIs
StatePublished - 2022
Event2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022 - New Orleans, United States
Duration: 19 Jun 202224 Jun 2022

Publication series

NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume2022-June
ISSN (Print)1063-6919

Conference

Conference2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
Country/TerritoryUnited States
CityNew Orleans
Period19/06/2224/06/22

Keywords

  • 3D from multi-view and sensors
  • Navigation and autonomous driving

Fingerprint

Dive into the research topics of 'Revisiting Domain Generalized Stereo Matching Networks from a Feature Consistency Perspective'. Together they form a unique fingerprint.

Cite this