Reconstructing part-level 3D models from a single image

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Understanding an image with 3D representations has been an increasingly attractive topic in computer vision. The state-of the-art 3D reconstruction methods usually focus on the reconstruction of the holistic object, while missing important part information, which is crucial in robotic interaction and virtual reality applications. To solve this issue, we make the first attempt to reconstruct the 3D models with part-level representations in a unified framework. With the input of the singleview images, we first develop a feature enhancement encoder to incorporate discriminative local features into the feature representation. The local features are selected adaptively by a learnable local awareness module. Then the enhanced local features are fused with the global branch to form the 3D representations. We then develop a 3D part generator to decode the image priors to 3D parts with a 3D focal loss, which enables the representations of small parts. Experimental results indicate that our model generates reliable part-level structures while achieving state-of-the-art performance in object-level recovering.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Multimedia and Expo, ICME 2020
PublisherIEEE Computer Society
ISBN (Electronic)9781728113319
DOIs
StatePublished - Jul 2020
Event2020 IEEE International Conference on Multimedia and Expo, ICME 2020 - London, United Kingdom
Duration: 6 Jul 202010 Jul 2020

Publication series

NameProceedings - IEEE International Conference on Multimedia and Expo
Volume2020-July
ISSN (Print)1945-7871
ISSN (Electronic)1945-788X

Conference

Conference2020 IEEE International Conference on Multimedia and Expo, ICME 2020
Country/TerritoryUnited Kingdom
CityLondon
Period6/07/2010/07/20

Keywords

  • 3D reconstruction
  • Part-level
  • Single-view

Fingerprint

Dive into the research topics of 'Reconstructing part-level 3D models from a single image'. Together they form a unique fingerprint.

Cite this