TY - GEN
T1 - Semantic segmentation based on aggregated features and contextual information
AU - Zheng, Chuanxia
AU - Wang, Jianhua
AU - Chen, Weihai
AU - Wu, Xingming
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016
Y1 - 2016
N2 - In this paper, a novel semantic segmentation model based on aggregated features and contextual information is proposed. Given an RGB-D image, we train a support vector machine (SVM) to predict initial labels using aggregated features, and then optimize the predicted results using contextual information. For aggregated features, the local features on regions are extracted to capture visual appearance of object, and the global features are exploited to represent scene information such that the proposed model can utilize more discriminative features. For contextual information, a novel multi-label conditional random field (CRF) model is constructed to jointly optimize the initial semantic and attribute predicted results. The experimental results on the public NYU v2 dataset demonstrate the proposed model outperforms the existing state-of-the-art methods on a challenging 40 classes task, yielding a higher mean IU accuracy of 33.7% and pixel average accuracy of 64.1%. Especially, the prediction accuracy of 'small' classes has been improved significantly.
AB - In this paper, a novel semantic segmentation model based on aggregated features and contextual information is proposed. Given an RGB-D image, we train a support vector machine (SVM) to predict initial labels using aggregated features, and then optimize the predicted results using contextual information. For aggregated features, the local features on regions are extracted to capture visual appearance of object, and the global features are exploited to represent scene information such that the proposed model can utilize more discriminative features. For contextual information, a novel multi-label conditional random field (CRF) model is constructed to jointly optimize the initial semantic and attribute predicted results. The experimental results on the public NYU v2 dataset demonstrate the proposed model outperforms the existing state-of-the-art methods on a challenging 40 classes task, yielding a higher mean IU accuracy of 33.7% and pixel average accuracy of 64.1%. Especially, the prediction accuracy of 'small' classes has been improved significantly.
UR - https://www.scopus.com/pages/publications/85016807066
U2 - 10.1109/ROBIO.2016.7866432
DO - 10.1109/ROBIO.2016.7866432
M3 - 会议稿件
AN - SCOPUS:85016807066
T3 - 2016 IEEE International Conference on Robotics and Biomimetics, ROBIO 2016
SP - 862
EP - 867
BT - 2016 IEEE International Conference on Robotics and Biomimetics, ROBIO 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2016 IEEE International Conference on Robotics and Biomimetics, ROBIO 2016
Y2 - 3 December 2016 through 7 December 2016
ER -