跳到主要导航 跳到搜索 跳到主要内容

Windowed bundle adjustment framework for unsupervised learning of monocular depth estimation with U-Net extension and clip loss

  • Lipu Zhou*
  • , Michael Kaess
  • *此作品的通讯作者
  • Carnegie Mellon University

科研成果: 期刊稿件文章同行评审

摘要

This letter presents a self-supervised framework for learning depth from monocular videos. In particular, the main contributions of this letter include: (1) We present a windowed bundle adjustment framework to train the network. Compared to most previous works that only consider constraints from consecutive frames, our framework increases the camera baseline and introduces more constraints to avoid overfitting. (2) We extend the widely used U-Net architecture by applying a Spatial Pyramid Net (SPN) and a Super Resolution Net (SRN). The SPN fuses information from an image spatial pyramid for the depth estimation, which addresses the context information attenuation problem of the original U-Net. The SRN learns to estimate a high resolution depth map from a low resolution image, which can benefit the recovery of details. (3) We adopt a clip loss function to handle moving objects and occlusions that were solved by designing complicated network or requiring extra information (such as segmentation mask [1]) in previous works. Experimental results show that our algorithm provides state-of-the-art results on the KITTI benchmark.

源语言英语
文章编号9013050
页(从-至)3283-3290
页数8
期刊IEEE Robotics and Automation Letters
5
2
DOI
出版状态已出版 - 4月 2020
已对外发布

指纹

探究 'Windowed bundle adjustment framework for unsupervised learning of monocular depth estimation with U-Net extension and clip loss' 的科研主题。它们共同构成独一无二的指纹。

引用此