跳到主要导航 跳到搜索 跳到主要内容

Cross-domain human parsing via adversarial feature and label adaptation

  • Si Liu
  • , Yao Sun
  • , Defa Zhu
  • , Guanghui Ren
  • , Yu Chen
  • , Jiashi Feng
  • , Jizhong Han
  • CAS - Institute of Information Engineering
  • Nanjing University of Information Science & Technology
  • JD.com, Inc.
  • National University of Singapore

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Human parsing has been extensively studied recently (Yamaguchi et al. 2012; Xia et al. 2017) due to its wide applications in many important scenarios. Mainstream fashion parsing models (i.e., parsers) focus on parsing the high-resolution and clean images. However, directly applying the parsers trained on benchmarks of high-quality samples to a particular application scenario in the wild, e.g., a canteen, airport or workplace, often gives non-satisfactory performance due to domain shift. In this paper, we explore a new and challenging cross-domain human parsing problem: taking the benchmark dataset with extensive pixel-wise labeling as the source domain, how to obtain a satisfactory parser on a new target domain without requiring any additional manual labeling? To this end, we propose a novel and efficient cross-domain human parsing model to bridge the cross-domain differences in terms of visual appearance and environment conditions and fully exploit commonalities across domains. Our proposed model explicitly learns a feature compensation network, which is specialized for mitigating the cross-domain differences. A discriminative feature adversarial network is introduced to supervise the feature compensation to effectively reduces the discrepancy between feature distributions of two domains. Besides, our proposed model also introduces a structured label adversarial network to guide the parsing results of the target domain to follow the high-order relationships of the structured labels shared across domains. The proposed framework is end-to-end trainable, practical and scalable in real applications. Extensive experiments are conducted where LIP dataset is the source domain and 4 different datasets including surveillance videos, movies and runway shows without any annotations, are evaluated as target domains. The results consistently confirm data efficiency and performance advantages of the proposed method for the challenging cross-domain human parsing problem.

源语言英语
主期刊名32nd AAAI Conference on Artificial Intelligence, AAAI 2018
出版商AAAI press
7146-7153
页数8
ISBN(电子版)9781577358008
出版状态已出版 - 2018
已对外发布
活动32nd AAAI Conference on Artificial Intelligence, AAAI 2018 - New Orleans, 美国
期限: 2 2月 20187 2月 2018

出版系列

姓名32nd AAAI Conference on Artificial Intelligence, AAAI 2018

会议

会议32nd AAAI Conference on Artificial Intelligence, AAAI 2018
国家/地区美国
New Orleans
时期2/02/187/02/18

指纹

探究 'Cross-domain human parsing via adversarial feature and label adaptation' 的科研主题。它们共同构成独一无二的指纹。

引用此