跳到主要导航 跳到搜索 跳到主要内容

Self-Supervised Learning for Monocular Depth Estimation on Minimally Invasive Surgery Scenes

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Self-supervised learning algorithms that compute depth map from monocular videos have achieved remarkable performance on urban scenes and have been applied extensively. These techniques still face significant challenges, however, when applied directly to endoscopic videos because of the brightness variations from frame to frame and inadequate representation learning during the training phase. Inspired by the optical flow for motion alignment between adjacent frames, we design a AFNet with structural stability loss and residual-based smoothness loss to learn the appearance flow across adjacent frames, which handles the brightness inconsistency issue efficaciously. In addition, we propose a novel self-attention mechanism named feature scaling module to alleviate the inadequate representation learning problem. In a comparison study to the current state-of-the-art self-supervised methods explored for urban videos on the SCARED dataset, the developed model surpasses existing methods by a large margin.

源语言英语
主期刊名2021 IEEE International Conference on Robotics and Automation, ICRA 2021
出版商Institute of Electrical and Electronics Engineers Inc.
7159-7165
页数7
ISBN(电子版)9781728190778
DOI
出版状态已出版 - 2021
活动2021 IEEE International Conference on Robotics and Automation, ICRA 2021 - Xi'an, 中国
期限: 30 5月 20215 6月 2021

出版系列

姓名Proceedings - IEEE International Conference on Robotics and Automation
2021-May
ISSN(印刷版)1050-4729

会议

会议2021 IEEE International Conference on Robotics and Automation, ICRA 2021
国家/地区中国
Xi'an
时期30/05/215/06/21

指纹

探究 'Self-Supervised Learning for Monocular Depth Estimation on Minimally Invasive Surgery Scenes' 的科研主题。它们共同构成独一无二的指纹。

引用此