Diffusion-GAN: Training GANs with Diffusion
Zhendong Wang, Huangjie Zheng, Pengcheng He, Weizhu Chen, Mingyuan Zhou
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/Zhendong-Wang/Diffusion-GANOfficialIn paperpytorch★ 696
- github.com/zhendong-wang/prompt-diffusionpytorch★ 414
- github.com/mingyuanzhou/sid-lsgpytorch★ 96
- github.com/jegzheng/truncated-diffusion-probabilistic-modelspytorch★ 36
Abstract
Generative adversarial networks (GANs) are challenging to train stably, and a promising remedy of injecting instance noise into the discriminator input has not been very effective in practice. In this paper, we propose Diffusion-GAN, a novel GAN framework that leverages a forward diffusion chain to generate Gaussian-mixture distributed instance noise. Diffusion-GAN consists of three components, including an adaptive diffusion process, a diffusion timestep-dependent discriminator, and a generator. Both the observed and generated data are diffused by the same adaptive diffusion process. At each diffusion timestep, there is a different noise-to-data ratio and the timestep-dependent discriminator learns to distinguish the diffused real data from the diffused generated data. The generator learns from the discriminator's feedback by backpropagating through the forward diffusion chain, whose length is adaptively adjusted to balance the noise and data levels. We theoretically show that the discriminator's timestep-dependent strategy gives consistent and helpful guidance to the generator, enabling it to match the true data distribution. We demonstrate the advantages of Diffusion-GAN over strong GAN baselines on various datasets, showing that it can produce more realistic images with higher stability and data efficiency than state-of-the-art GANs.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| AFHQ Cat | Diffusion InsGen | FID | 2.4 | — | Unverified |
| AFHQ Dog | Diffusion InsGen | FID | 4.83 | — | Unverified |
| AFHQ Wild | Diffusion InsGen | FID | 1.51 | — | Unverified |
| CelebA 64x64 | Diffusion StyleGAN2 | FID | 1.69 | — | Unverified |
| FFHQ 1024 x 1024 | Diffusion StyleGAN2 | FID | 2.83 | — | Unverified |
| LSUN Bedroom 256 x 256 | Diffusion StyleGAN2 | FID | 3.65 | — | Unverified |
| LSUN Bedroom 256 x 256 | Diffusion ProjectedGAN | FID | 1.43 | — | Unverified |
| LSUN Bedroom 256 x 256 | Diffusion ProjectedGAN (DINOv2) | FD | 547.61 | — | Unverified |
| LSUN Churches 256 x 256 | Diffusion StyleGAN2 | FID | 3.17 | — | Unverified |
| LSUN Churches 256 x 256 | Diffusion ProjectedGAN | FID | 1.85 | — | Unverified |
| STL-10 | Diffusion ProjectedGAN | FID | 6.91 | — | Unverified |
| STL-10 | Diffusion StyleGAN2 | FID | 11.53 | — | Unverified |