TY - GEN
T1 - Small components parsing via multi-feature fusion network
AU - Leng, Zhiying
AU - Lu, Yang
AU - Liang, Xiaohui
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/7
Y1 - 2020/7
N2 - Part parsing is a fundamental task towards fine image understanding in the multimedia and visual field. At present, the researchers working on part parsing focus on objects with large components, such as human, car. This paper centers on segmenting objects with small components. We call it small components parsing. In this paper, we propose a novel strategy for small components parsing, fusing multi-feature to utilize context information. We introduce Separable Spatial Pyramid module to embed spatial context information by fusing different scale spatial features. In decoding stage, attention-based feature fusion unit is drawn to utilize semantic context information in order to highlight details. Specifically, we design the Residual Upsampling manner to recover more details, considering spatial and channel characteristics. Experiments on RHD-PARSING and CamVid datasets demonstrate our method has achieved decent performance for small components parsing, taking hand parsing as an example, and has reached competitive results in scene parsing.
AB - Part parsing is a fundamental task towards fine image understanding in the multimedia and visual field. At present, the researchers working on part parsing focus on objects with large components, such as human, car. This paper centers on segmenting objects with small components. We call it small components parsing. In this paper, we propose a novel strategy for small components parsing, fusing multi-feature to utilize context information. We introduce Separable Spatial Pyramid module to embed spatial context information by fusing different scale spatial features. In decoding stage, attention-based feature fusion unit is drawn to utilize semantic context information in order to highlight details. Specifically, we design the Residual Upsampling manner to recover more details, considering spatial and channel characteristics. Experiments on RHD-PARSING and CamVid datasets demonstrate our method has achieved decent performance for small components parsing, taking hand parsing as an example, and has reached competitive results in scene parsing.
KW - Feature Fusion
KW - Semantic Segmentation
KW - Small Components Parsing
UR - https://www.scopus.com/pages/publications/85090390022
U2 - 10.1109/ICME46284.2020.9102932
DO - 10.1109/ICME46284.2020.9102932
M3 - 会议稿件
AN - SCOPUS:85090390022
T3 - Proceedings - IEEE International Conference on Multimedia and Expo
BT - 2020 IEEE International Conference on Multimedia and Expo, ICME 2020
PB - IEEE Computer Society
T2 - 2020 IEEE International Conference on Multimedia and Expo, ICME 2020
Y2 - 6 July 2020 through 10 July 2020
ER -