arXiv:1612.04357 Abstract | arXiv Analytics

arXiv:1612.04357 [cs.CV]Abstract References Reviews Resources

Stacked Generative Adversarial Networks

Xun Huang, Yixuan Li, Omid Poursaeed, John Hopcroft, Serge Belongie

Published 2016-12-13Version 1

In this paper we aim to leverage the powerful bottom-up discriminative representations to guide a top-down generative model. We propose a novel generative model named Stacked Generative Adversarial Networks (SGAN), which is trained to invert the hierarchical representations of a discriminative bottom-up deep network. Our model consists of a top-down stack of GANs, each trained to generate "plausible" lower-level representations, conditioned on higher-level representations. A representation discriminator is introduced at each feature hierarchy to encourage the representation manifold of the generator to align with that of the bottom-up discriminative network, providing intermediate supervision. In addition, we introduce a conditional loss that encourages the use of conditional information from the layer above, and a novel entropy loss that maximizes a variational lower bound on the conditional entropy of generator outputs. To the best of our knowledge, the entropy loss is the first attempt to tackle the conditional model collapse problem that is common in conditional GANs. We first train each GAN of the stack independently, and then we train the stack end-to-end. Unlike the original GAN that uses a single noise vector to represent all the variations, our SGAN decomposes variations into multiple levels and gradually resolves uncertainties in the top-down generative process. Experiments demonstrate that SGAN is able to generate diverse and high-quality images, as well as being more interpretable than a vanilla GAN.

Comments: Under review

Categories: cs.CV, cs.LG, cs.NE, stat.ML

Keywords: stacked generative adversarial networks, representation, generative model, named stacked generative adversarial, model named stacked generative

Related articles: Most relevant | Search more

arXiv:1508.04035 [cs.CV] (Published 2015-08-17)

A Generative Model for Multi-Dialect Representation

Emmanuel N. Osegi

arXiv:1906.11881 [cs.CV] (Published 2019-06-11)

Explicit Disentanglement of Appearance and Perspective in Generative Models

Nicki Skafte Detlefsen, Søren Hauberg

arXiv:1910.07169 [cs.CV] (Published 2019-10-16)

Generative Modeling for Small-Data Object Detection