面向单幅图像的高质量深度估计方法

Translated title of the contribution: A High-Quality Depth Estimation Method for Single Image
  • Yongtang Bao
  • , Shuai Yan
  • , Yue Qi*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Depth estimation from a single image is critical in robot navigation and scene understanding. It is also a complex problem in computer vision. Aiming at the inaccurate depth estimation of a single image, we propose a single-image depth estimation method based on ViT. First, we downsample the image by the pre-trained DenseNet and encode the features into sequences suitable for ViT. Then, the densely connected ViT processes the global context information, and the feature sequence is reassembled into high-dimensional feature maps. Finally, Upsampling to obtain a complete depth image. We conduct comparative experiments with some reprsentative depth estimation methods on the NYU V2 dataset, and ablation experiments on the network structure. This paper quantitatively analyzes the average relative error, root means square error, and other errors. The results show that the method can generate high-quality depth images with rich details for a single image. Compared with the traditional encoder-decoder method, the PSNR value of the proposed method is increased by 1.052 dB on average, the REL is decreased by 7.7%–21.8%, and the RMS is reduced by 5.6%–16.9%.

Translated title of the contributionA High-Quality Depth Estimation Method for Single Image
Original languageChinese (Traditional)
Pages (from-to)1761-1770
Number of pages10
JournalJisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics
Volume36
Issue number11
DOIs
StatePublished - 2024

Fingerprint

Dive into the research topics of 'A High-Quality Depth Estimation Method for Single Image'. Together they form a unique fingerprint.

Cite this