Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–1000 of 6689 papers

Title	Date	Tasks	Status	Hype	Score
Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability	Mar 26, 2025	Age/UnbiasedDecision Making	CodeCode Available	1	5
ClimateGAN: Raising Climate Change Awareness by Generating Images of Floods	Oct 6, 2021	Conditional Image GenerationDomain Adaptation	CodeCode Available	1	5
Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization	May 31, 2024	Adversarial AttackImage Generation	CodeCode Available	1	5
End-to-End Diffusion Latent Optimization Improves Classifier Guidance	Mar 23, 2023	DenoisingImage Generation	CodeCode Available	1	5
Dissolving Is Amplifying: Towards Fine-Grained Anomaly Detection	Feb 28, 2023	Anomaly DetectionContrastive Learning	CodeCode Available	1	5
CLIP-Diffusion-LM: Apply Diffusion Model on Image Captioning	Oct 10, 2022	DecoderDenoising	CodeCode Available	1	5
CLIP-Forge: Towards Zero-Shot Text-to-Shape Generation	Oct 6, 2021	Image GenerationText to Image Generation	CodeCode Available	1	5
CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP	Mar 1, 2022	Image GenerationText to Image Generation	CodeCode Available	1	5
Glow: Generative Flow with Invertible 1x1 Convolutions	Jul 9, 2018	Density EstimationImage Generation	CodeCode Available	1	5
GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation	Mar 17, 2023	DecoderImage Generation	CodeCode Available	1	5
Towards Disentangling Latent Space for Unsupervised Semantic Face Editing	Nov 5, 2020	AttributeImage Generation	CodeCode Available	1	5
Clockwork Diffusion: Efficient Generation With Model-Step Distillation	Dec 13, 2023	DenoisingImage Generation	CodeCode Available	1	5
CLoG: Benchmarking Continual Learning of Image Generation Models	Jun 7, 2024	BenchmarkingContinual Learning	CodeCode Available	1	5
IDF-CR: Iterative Diffusion Process for Divide-and-Conquer Cloud Removal in Remote-sensing Images	Mar 18, 2024	Cloud RemovalImage Generation	CodeCode Available	1	5
Disentanglement in a GAN for Unconditional Speech Synthesis	Jul 4, 2023	DisentanglementGenerative Adversarial Network	CodeCode Available	1	5
ArtiFact: A Large-Scale Dataset with Artificial and Factual Images for Generalizable and Robust Synthetic Image Detection	Feb 23, 2023	Image GenerationMulti-class Classification	CodeCode Available	1	5
GLAMI-1M: A Multilingual Image-Text Fashion Dataset	Nov 17, 2022	ClassificationImage Generation	CodeCode Available	1	5
Cloud Removal in Satellite Images Using Spatiotemporal Generative Networks	Dec 14, 2019	Cloud RemovalEarth Observation	CodeCode Available	1	5
Disentangled Image Generation Through Structured Noise Injection	Apr 26, 2020	DisentanglementImage Generation	CodeCode Available	1	5
GLeaD: Improving GANs with A Generator-Leading Task	Dec 7, 2022	domain classificationGenerative Adversarial Network	CodeCode Available	1	5
High-Fidelity Synthesis with Disentangled Representation	Jan 13, 2020	DisentanglementGenerative Adversarial Network	CodeCode Available	1	5
CNN-generated images are surprisingly easy to spot... for now	Dec 23, 2019	Data AugmentationImage Generation	CodeCode Available	1	5
Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation	Jun 15, 2022	Contrastive LearningDenoising	CodeCode Available	1	5
Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models	Nov 28, 2023	Image GenerationScene Text Detection	CodeCode Available	1	5
CACTI: A Framework for Scalable Multi-Task Multi-Scene Visual Imitation Learning	Dec 12, 2022	Data AugmentationImage Generation	CodeCode Available	1	5
Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners	May 18, 2023	Image GenerationImage-text matching	CodeCode Available	1	5
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation	Dec 31, 2021	Image CaptioningImage Generation	CodeCode Available	1	5
Erasing Undesirable Influence in Diffusion Models	Jan 11, 2024	DenoisingImage Generation	CodeCode Available	1	5
Artistic Glyph Image Synthesis via One-Stage Few-Shot Learning	Oct 11, 2019	Few-Shot LearningImage Generation	CodeCode Available	1	5
ESVAE: An Efficient Spiking Variational Autoencoder with Reparameterizable Poisson Spiking Sampling	Oct 23, 2023	DecoderImage Generation	CodeCode Available	1	5
Image Generation for Efficient Neural Network Training in Autonomous Drone Racing	Aug 6, 2020	Dataset GenerationEfficient Neural Network	CodeCode Available	1	5
SketchyCOCO: Image Generation from Freehand Scene Sketches	Mar 5, 2020	AttributeGenerative Adversarial Network	CodeCode Available	1	5
ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis	Apr 21, 2024	3D-Aware Image SynthesisContrastive Learning	CodeCode Available	1	5
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections	May 30, 2024	Image Generation	CodeCode Available	1	5
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models	Oct 28, 2024	Adversarial TextImage Generation	CodeCode Available	1	5
Discovering Transferable Forensic Features for CNN-generated Images Detection	Aug 24, 2022	Image ForensicsImage Generation	CodeCode Available	1	5
GIFD: A Generative Gradient Inversion Method with Feature Domain Optimization	Aug 9, 2023	Federated LearningImage Generation	CodeCode Available	1	5
Cached Multi-Lora Composition for Multi-Concept Image Generation	Feb 7, 2025	Computational EfficiencyDenoising	CodeCode Available	1	5
Discovering Interpretable Directions in the Semantic Latent Space of Diffusion Models	Mar 20, 2023	AttributeDenoising	CodeCode Available	1	5
C2N: Practical Generative Noise Modeling for Real-World Denoising	Feb 19, 2022	DenoisingImage Denoising	CodeCode Available	1	5
Anycost GANs for Interactive Image Synthesis and Editing	Mar 4, 2021	Image Generation	CodeCode Available	1	5
Evaluating the feasibility of using Generative Models to generate Chest X-Ray Data	May 30, 2023	Image GenerationMedical Diagnosis	CodeCode Available	1	5
CoLa-Diff: Conditional Latent Diffusion Model for Multi-Modal MRI Synthesis	Mar 24, 2023	CoLAImage Generation	CodeCode Available	1	5
A Scene-Text Synthesis Engine Achieved Through Learning from Decomposed Real-World Data	Sep 6, 2022	Image Generation	CodeCode Available	1	5
2D medical image synthesis using transformer-based denoising diffusion probabilistic model	May 5, 2023	DenoisingDiversity	CodeCode Available	1	5
Discriminator-Cooperated Feature Map Distillation for GAN Compression	Dec 29, 2022	Image GenerationKnowledge Distillation	CodeCode Available	1	5
Evolving Normalization-Activation Layers	Apr 6, 2020	image-classificationImage Classification	CodeCode Available	1	5
GIRAFFE HD: A High-Resolution 3D-aware Generative Model	Mar 28, 2022	DisentanglementImage Generation	CodeCode Available	1	5
Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling	Jul 5, 2022	DiversityImage Generation	CodeCode Available	1	5
Direct Ascent Synthesis: Revealing Hidden Generative Capabilities in Discriminative Models	Feb 11, 2025	Image GenerationStyle Transfer	CodeCode Available	1	5

Show:10 25 50

← PrevPage 20 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified