跳到主要导航 跳到搜索 跳到主要内容

Learnable patchmatch and self-teaching for multi-frame depth estimation in monocular endoscopy

  • Shandong University
  • Beihang University

科研成果: 期刊稿件文章同行评审

摘要

This work delves into unsupervised monocular depth estimation in endoscopy, which leverages adjacent frames to establish supervisory signals during the training phase. For many clinical applications, e.g., surgical navigation, temporally correlated frames are also available at test time. However, most existing monocular methods struggle to make effective use of temporal information during both training and inference, primarily due to the inherent challenges of endoscopic imagery, including low- or homogeneous-texture regions and brightness fluctuations between frames. To fully exploit the temporal information in endoscopic scenes, we propose a novel unsupervised multi-frame monocular depth estimation model. The proposed model integrates a learnable patchmatch module to adaptively increase the discriminative ability in regions with low or homogeneous textures, and enforces cross-teaching and self-teaching consistencies to provide efficacious regularizations towards brightness fluctuations. Furthermore, as a byproduct of the self-teaching paradigm, the proposed model is able to improve the depth predictions when more frames are input at test time. We conduct detailed experiments on multiple datasets, and the experimental results indicate that the proposed method exceeds prior state-of-the-art competitors. The source code and trained models will be publicly available at https://github.com/ShuweiShao/FrameDepth.

源语言英语
文章编号112463
期刊Engineering Applications of Artificial Intelligence
162
DOI
出版状态已出版 - 20 12月 2025

指纹

探究 'Learnable patchmatch and self-teaching for multi-frame depth estimation in monocular endoscopy' 的科研主题。它们共同构成独一无二的指纹。

引用此