TY - GEN
T1 - Cuboids detection in RGB-D images via Maximum Weighted Clique
AU - Zhang, Han
AU - Chen, Xiaowu
AU - Zhang, Yu
AU - Li, Jia
AU - Li, Qing
AU - Wang, Xiaogang
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2015/8/4
Y1 - 2015/8/4
N2 - Cuboid detection is an essential step for understanding 3D structure of scenes. As most of indoor scene cuboids are actually objects, we propose in this paper an object-based approach to detect 3D cuboids in indoor RGB-D images. The proposed approach is learning-free and can handle general object classes rather than a limited pre-defined category set. In our approach, we first apply an extended version of the CPMC framework to generate a set of segment hypotheses, and fit a set of cuboid candidates. Given the candidate set, we select several cuboids that can provide plausible interpretations of the images by solving a Maximum Weighted Clique (MWC) problem. With this formulation, a set of ranked mid-level representations of the input image is obtained, and are further re-ranked by Maximal Marginal Relevance (MMR) measure to improve their diversity. Experimental results on NYU-V2 dataset shows that our method significantly outperforms the state-of-the-art, and shows impressive results.
AB - Cuboid detection is an essential step for understanding 3D structure of scenes. As most of indoor scene cuboids are actually objects, we propose in this paper an object-based approach to detect 3D cuboids in indoor RGB-D images. The proposed approach is learning-free and can handle general object classes rather than a limited pre-defined category set. In our approach, we first apply an extended version of the CPMC framework to generate a set of segment hypotheses, and fit a set of cuboid candidates. Given the candidate set, we select several cuboids that can provide plausible interpretations of the images by solving a Maximum Weighted Clique (MWC) problem. With this formulation, a set of ranked mid-level representations of the input image is obtained, and are further re-ranked by Maximal Marginal Relevance (MMR) measure to improve their diversity. Experimental results on NYU-V2 dataset shows that our method significantly outperforms the state-of-the-art, and shows impressive results.
KW - Cuboid detection
KW - depth image
KW - maximum weighted clique
KW - scene understanding
UR - https://www.scopus.com/pages/publications/84946072298
U2 - 10.1109/ICME.2015.7177405
DO - 10.1109/ICME.2015.7177405
M3 - 会议稿件
AN - SCOPUS:84946072298
T3 - Proceedings - IEEE International Conference on Multimedia and Expo
BT - 2015 IEEE International Conference on Multimedia and Expo, ICME 2015
PB - IEEE Computer Society
T2 - IEEE International Conference on Multimedia and Expo, ICME 2015
Y2 - 29 June 2015 through 3 July 2015
ER -