Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–700 of 6689 papers

Title	Date	Tasks	Status	Hype
Diffusion Probabilistic Models beat GANs on Medical Images	Dec 14, 2022	DenoisingDiversity	CodeCode Available	2
The Stable Artist: Steering Semantics in Diffusion Latent Space	Dec 12, 2022	Image Generation	CodeCode Available	2
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis	Dec 9, 2022	AttributeImage Generation	CodeCode Available	2
Semantic-Conditional Diffusion Networks for Image Captioning	Dec 6, 2022	Cross-Modal RetrievalDecoder	CodeCode Available	2
Wavelet Diffusion Models are fast and scalable Image Generators	Nov 29, 2022	BlockingImage Generation	CodeCode Available	2
Latent Video Diffusion Models for High-Fidelity Long Video Generation	Nov 23, 2022	DenoisingImage Generation	CodeCode Available	2
Inversion-Based Style Transfer with Diffusion Models	Nov 23, 2022	DenoisingImage Generation	CodeCode Available	2
SinDiffusion: Learning a Diffusion Model from a Single Natural Image	Nov 22, 2022	DenoisingDiversity	CodeCode Available	2
Person Image Synthesis via Denoising Diffusion Model	Nov 22, 2022	DenoisingDiversity	CodeCode Available	2
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation	Nov 22, 2022	Image GenerationImage-to-Image Translation	CodeCode Available	2
RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation	Nov 17, 2022	3D Generation3D Reconstruction	CodeCode Available	2
MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis	Nov 16, 2022	Image GenerationRepresentation Learning	CodeCode Available	2
A Novel Sampling Scheme for Text- and Image-Conditional Image Synthesis in Quantized Latent Spaces	Nov 14, 2022	Conditional Image GenerationDenoising	CodeCode Available	2
Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures	Nov 14, 2022	3D GenerationImage Generation	CodeCode Available	2
Medical Diffusion: Denoising Diffusion Probabilistic Models for 3D Medical Image Generation	Nov 7, 2022	Computed Tomography (CT)Denoising	CodeCode Available	2
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers	Nov 2, 2022	Image GenerationText-to-Image Generation	CodeCode Available	2
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance	Oct 11, 2022	Image GenerationImage-to-Image Translation	CodeCode Available	2
What the DAAM: Interpreting Stable Diffusion Using Cross Attention	Oct 10, 2022	DenoisingDescriptive	CodeCode Available	2
Building Normalizing Flows with Stochastic Interpolants	Sep 30, 2022	BenchmarkingDensity Estimation	CodeCode Available	2
Personalizing Text-to-Image Generation via Aesthetic Gradients	Sep 25, 2022	Image GenerationText to Image Generation	CodeCode Available	2
Poisson Flow Generative Models	Sep 22, 2022	Image Generation	CodeCode Available	2
3DFaceShop: Explicitly Controllable 3D-Aware Portrait Generation	Sep 12, 2022	3D Face AnimationDisentanglement	CodeCode Available	2
GAUDI: A Neural Architect for Immersive 3D Scene Generation	Jul 27, 2022	Image GenerationScene Generation	CodeCode Available	2
Unsupervised Medical Image Translation with Adversarial Diffusion Models	Jul 17, 2022	DiversityImage Generation	CodeCode Available	2
Collaborative Neural Rendering using Anime Character Sheets	Jul 12, 2022	Image GenerationImage to 3D	CodeCode Available	2
Semantic Image Synthesis via Diffusion Models	Jun 30, 2022	DecoderDenoising	CodeCode Available	2
DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection	Jun 23, 2022	Change DetectionDecision Making	CodeCode Available	2
The ArtBench Dataset: Benchmarking Generative Models with Artworks	Jun 22, 2022	BenchmarkingConditional Image Generation	CodeCode Available	2
Blended Latent Diffusion	Jun 6, 2022	Image GenerationImage Inpainting	CodeCode Available	2
Diffusion-GAN: Training GANs with Diffusion	Jun 5, 2022	Image Generation	CodeCode Available	2
Improved Vector Quantized Diffusion Models	May 31, 2022	DenoisingImage Generation	CodeCode Available	2
Text2Human: Text-Driven Controllable Human Image Generation	May 31, 2022	DiversityHuman Parsing	CodeCode Available	2
IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis	May 31, 2022	3D-Aware Image SynthesisImage Generation	CodeCode Available	2
Unsupervised Misaligned Infrared and Visible Image Fusion via Cross-Modality Image Generation and Registration	May 24, 2022	Image GenerationInfrared And Visible Image Fusion	CodeCode Available	2
BBDM: Image-to-image Translation with Brownian Bridge Diffusion Models	May 16, 2022	Image GenerationImage-to-Image Translation	CodeCode Available	2
Deep PCB To COCO Convertor	May 1, 2022	ClassificationData Augmentation	CodeCode Available	2
CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers	Apr 28, 2022	Image GenerationLanguage Modeling	CodeCode Available	2
BCI: Breast Cancer Immunohistochemical Image Generation through Pyramid Pix2pix	Apr 25, 2022	Breast Cancer DetectionBreast Cancer Histology Image Classification	CodeCode Available	2
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance	Apr 18, 2022	Image Generation	CodeCode Available	2
Any-resolution Training for High-resolution Image Synthesis	Apr 14, 2022	2kImage Generation	CodeCode Available	2
Neural Texture Extraction and Distribution for Controllable Person Image Synthesis	Apr 13, 2022	Image Generation	CodeCode Available	2
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis	Mar 25, 2022	Image GenerationSpeech Synthesis	CodeCode Available	2
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors	Mar 24, 2022	Image GenerationSemantic Segmentation	CodeCode Available	2
Pix2NeRF: Unsupervised Conditional π-GAN for Single Image to Neural Radiance Fields Translation	Feb 26, 2022	3D-Aware Image SynthesisImage Generation	CodeCode Available	2
Self-Distilled StyleGAN: Towards Generation from Internet Photos	Feb 24, 2022	Image Generation	CodeCode Available	2
Progressive Distillation for Fast Sampling of Diffusion Models	Feb 1, 2022	Density EstimationImage Generation	CodeCode Available	2
StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets	Feb 1, 2022	Image Generation	CodeCode Available	2
Third Time's the Charm? Image and Video Editing with StyleGAN3	Jan 31, 2022	DisentanglementImage Generation	CodeCode Available	2
DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents	Jan 2, 2022	Image GenerationVocal Bursts Intensity Prediction	CodeCode Available	2
Splicing ViT Features for Semantic Appearance Transfer	Jan 2, 2022	Appearance TransferImage Generation	CodeCode Available	2

Show:10 25 50

← PrevPage 14 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified