Abstract
In this paper we address the problem of automatically parsing the fashion images with weak supervision from the user-generated color-category tags such as 'red jeans' and 'white T-shirt'. This problem is very challenging due to the large diversity of fashion items and the absence of pixel-level tags, which make the traditional fully supervised algorithms inapplicable. To solve the problem, we propose to combine the human pose estimation module, the MRF-based color and category inference module and the (super)pixel-level category classifier learning module to generate multiple well-performing category classifiers, which can be directly applied to parse the fashion items in the images. Besides, all the training images are parsed with color-category labels and the human poses of the images are estimated during the model learning phase in this work. We also construct a new fashion image dataset called Colorful-Fashion, in which all 2,682 images are labeled with pixel-level color-category labels. Extensive experiments on this dataset clearly show the effectiveness of the proposed method for the weakly supervised fashion parsing task.
| Original language | English |
|---|---|
| Article number | 6630093 |
| Pages (from-to) | 253-265 |
| Number of pages | 13 |
| Journal | IEEE Transactions on Multimedia |
| Volume | 16 |
| Issue number | 1 |
| DOIs | |
| State | Published - Jan 2014 |
| Externally published | Yes |
Keywords
- Fashion parsing
- Markov random fields
- weakly-supervised learning
Fingerprint
Dive into the research topics of 'Fashion parsing with weak color-category labels'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver