StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets

2022-02-01Code Available2· sign in to hype

Axel Sauer, Katja Schwarz, Andreas Geiger

Code Available — Be the first to reproduce this paper.

Code

github.com/autonomousvision/stylegan_xl
Officialpytorch★ 994
github.com/autonomousvision/stylegan-xl
pytorch★ 994

Abstract

Computer graphics has experienced a recent surge of data-centric approaches for photorealistic and controllable content creation. StyleGAN in particular sets new standards for generative modeling regarding image quality and controllability. However, StyleGAN's performance severely degrades on large unstructured datasets such as ImageNet. StyleGAN was designed for controllability; hence, prior works suspect its restrictive design to be unsuitable for diverse datasets. In contrast, we find the main limiting factor to be the current training strategy. Following the recently introduced Projected GAN paradigm, we leverage powerful neural network priors and a progressive growing strategy to successfully train the latest StyleGAN3 generator on ImageNet. Our final model, StyleGAN-XL, sets a new state-of-the-art on large-scale image synthesis and is the first to generate images at a resolution of 1024^2 at such a dataset scale. We demonstrate that this model can invert and edit images beyond the narrow domain of portraits or specific object classes.

Tasks

Image Generation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
FFHQ 1024 x 1024	StyleGAN-XL	FID	2.02	—	Unverified
FFHQ 256 x 256	StyleGAN-XL	FID	2.19	—	Unverified
FFHQ 256 x 256	StyleGAN-XL (DINOv2)	FD	240.07	—	Unverified
FFHQ 512 x 512	StyleGAN-XL	FID	2.41	—	Unverified
ImageNet 128x128	StyleGAN-XL	FID	1.81	—	Unverified
ImageNet 256x256	StyleGAN-XL	FID	2.3	—	Unverified
ImageNet 32x32	StyleGAN-XL	FID	1.1	—	Unverified
ImageNet 512x512	StyleGAN-XL	FID	2.4	—	Unverified
ImageNet 64x64	StyleGAN-XL	FID	1.51	—	Unverified
Pokemon 1024x1024	StyleGAN-XL	FID	25.47	—	Unverified
Pokemon 256x256	StyleGAN-XL	FID	23.97	—	Unverified

StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets

Code

Abstract

Tasks

Benchmark Results

Reproductions