TY - JOUR
T1 - Multi-Process Training GAN for Identity-Preserving Face Synthesis
AU - Tang, Zhiyong
AU - Yang, Jianbing
AU - Pei, Zhongcai
AU - Song, Xiao
AU - Ge, Baoshuang
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2019
Y1 - 2019
N2 - Recently, the advent of generative adversarial networks (GANs) in synthesizing identity-preserving faces has aroused the considerable interest of many scholars. However, face attribute representation learning, which is explicitly disentangled from identity feature and synthesizes identity-preserving face images with high diversity and quality in other datasets, still remains challenging. To cope with that, this paper proposes multi-process training GAN, or MP-GAN for short, which significantly improves the disentangled representation, diversity, and quality. Unlike other existing single-process models that map noise to a final output resolution image in a single training process, MP-GAN divides training into multiple processes. The main idea is to first generate lower resolution images that contain lower frequency feature information through competition and then extract their disentangled facial features to generate a higher resolution image. Furthermore, an identity-preserving image with real identity feature and disentangled facial feature could be generated at the final output resolution training process. The distinct benefits are not only getting diverse facial feature generation but also achieving disentangled representation from the lower resolution training processes and rendering a photo-realistic image that contains high diversity but preserves identity at the final output resolution training process. The high performance of this method is highlighted by quantitative and qualitative comparisons. We conclude that MP-GAN can generate face images featuring high diversity and quality while efficiently preserving identity, thereby significantly outperforming most modern advanced methods.
AB - Recently, the advent of generative adversarial networks (GANs) in synthesizing identity-preserving faces has aroused the considerable interest of many scholars. However, face attribute representation learning, which is explicitly disentangled from identity feature and synthesizes identity-preserving face images with high diversity and quality in other datasets, still remains challenging. To cope with that, this paper proposes multi-process training GAN, or MP-GAN for short, which significantly improves the disentangled representation, diversity, and quality. Unlike other existing single-process models that map noise to a final output resolution image in a single training process, MP-GAN divides training into multiple processes. The main idea is to first generate lower resolution images that contain lower frequency feature information through competition and then extract their disentangled facial features to generate a higher resolution image. Furthermore, an identity-preserving image with real identity feature and disentangled facial feature could be generated at the final output resolution training process. The distinct benefits are not only getting diverse facial feature generation but also achieving disentangled representation from the lower resolution training processes and rendering a photo-realistic image that contains high diversity but preserves identity at the final output resolution training process. The high performance of this method is highlighted by quantitative and qualitative comparisons. We conclude that MP-GAN can generate face images featuring high diversity and quality while efficiently preserving identity, thereby significantly outperforming most modern advanced methods.
KW - Multi-process training
KW - disentangled representation
KW - diversity and quality
KW - identity-preserving
UR - https://www.scopus.com/pages/publications/85070316301
U2 - 10.1109/ACCESS.2019.2930203
DO - 10.1109/ACCESS.2019.2930203
M3 - 文章
AN - SCOPUS:85070316301
SN - 2169-3536
VL - 7
SP - 97641
EP - 97652
JO - IEEE Access
JF - IEEE Access
M1 - 8768064
ER -