TY - JOUR
T1 - StackGAN++
T2 - Realistic Image Synthesis with Stacked Generative Adversarial Networks
AU - Zhang, Han
AU - Xu, Tao
AU - Li, Hongsheng
AU - Zhang, Shaoting
AU - Wang, Xiaogang
AU - Huang, Xiaolei
AU - Metaxas, Dimitris N.
N1 - Funding Information:
This work is partially supported by the Air Force Office of Scientific Research (AFOSR) under the Dynamic Data- Driven Application Systems Program and National Science Foundation (NSF) 1763523, 1747778, 1733843 and 1703883 Awards; partially supported by the NSF under grant ABI-1661280 and the CNS-1629913; and partially supported by the General Research Fund sponsored by the Research Grants Council of Hong Kong (Nos. CUHK14213616, CUHK14206114, CUHK14205615, CUHK14203015, CUHK14239816, CUHK419412, CUHK14207814, CUHK14208417, CUHK14202217). Han Zhang and Tao Xu contributed equally to this work.
Funding Information:
This work is partially supported by the Air Force Office of Scientific Research (AFOSR) under the Dynamic Data-Driven Application Systems Program and National Science Foundation (NSF) 1763523, 1747778, 1733843 and 1703883 Awards; partially supported by the NSF under grant ABI-1661280 and the CNS-1629913; and partially supported by the General Research Fund sponsored by the Research Grants Council of Hong Kong (Nos. CUHK14213616, CUHK14206114, CUHK14205615, CUHK14203015, CUHK14239816, CUHK419412, CUHK14207814, CUHK14208417, CUHK14202217). Han Zhang and Tao Xu contributed equally to this work.
Publisher Copyright:
© 2018 IEEE.
PY - 2019/8/1
Y1 - 2019/8/1
N2 - Although Generative Adversarial Networks (GANs) have shown remarkable success in various tasks, they still face challenges in generating high quality images. In this paper, we propose Stacked Generative Adversarial Networks (StackGANs) aimed at generating high-resolution photo-realistic images. First, we propose a two-stage generative adversarial network architecture, StackGAN-v1, for text-to-image synthesis. The Stage-I GAN sketches the primitive shape and colors of a scene based on a given text description, yielding low-resolution images. The Stage-II GAN takes Stage-I results and the text description as inputs, and generates high-resolution images with photo-realistic details. Second, an advanced multi-stage generative adversarial network architecture, StackGAN-v2, is proposed for both conditional and unconditional generative tasks. Our StackGAN-v2 consists of multiple generators and multiple discriminators arranged in a tree-like structure; images at multiple scales corresponding to the same scene are generated from different branches of the tree. StackGAN-v2 shows more stable training behavior than StackGAN-v1 by jointly approximating multiple distributions. Extensive experiments demonstrate that the proposed stacked generative adversarial networks significantly outperform other state-of-the-art methods in generating photo-realistic images.
AB - Although Generative Adversarial Networks (GANs) have shown remarkable success in various tasks, they still face challenges in generating high quality images. In this paper, we propose Stacked Generative Adversarial Networks (StackGANs) aimed at generating high-resolution photo-realistic images. First, we propose a two-stage generative adversarial network architecture, StackGAN-v1, for text-to-image synthesis. The Stage-I GAN sketches the primitive shape and colors of a scene based on a given text description, yielding low-resolution images. The Stage-II GAN takes Stage-I results and the text description as inputs, and generates high-resolution images with photo-realistic details. Second, an advanced multi-stage generative adversarial network architecture, StackGAN-v2, is proposed for both conditional and unconditional generative tasks. Our StackGAN-v2 consists of multiple generators and multiple discriminators arranged in a tree-like structure; images at multiple scales corresponding to the same scene are generated from different branches of the tree. StackGAN-v2 shows more stable training behavior than StackGAN-v1 by jointly approximating multiple distributions. Extensive experiments demonstrate that the proposed stacked generative adversarial networks significantly outperform other state-of-the-art methods in generating photo-realistic images.
UR - http://www.scopus.com/inward/record.url?scp=85049955346&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85049955346&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2018.2856256
DO - 10.1109/TPAMI.2018.2856256
M3 - Article
C2 - 30010548
AN - SCOPUS:85049955346
SN - 0162-8828
VL - 41
SP - 1947
EP - 1962
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 8
M1 - 8411144
ER -