Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 6689 papers

Title	Date	Tasks	Status	Hype
Video Perception Models for 3D Scene Synthesis	Jun 25, 2025	3D ReconstructionImage Generation	—Unverified	0
EAR: Erasing Concepts from Unified Autoregressive Models	Jun 25, 2025	Image Generation	CodeCode Available	0
Angio-Diff: Learning a Self-Supervised Adversarial Diffusion Model for Angiographic Geometry Generation	Jun 24, 2025	Image Generation	CodeCode Available	0
SoK: Can Synthetic Images Replace Real Data? A Survey of Utility and Privacy of Synthetic Image Generation	Jun 24, 2025	Image GenerationPrivacy Preserving	—Unverified	0
Morse: Dual-Sampling for Lossless Acceleration of Diffusion Models	Jun 23, 2025	Image Generation	CodeCode Available	1
OmniGen2: Exploration to Advanced Multimodal Generation	Jun 23, 2025	Image Generationmultimodal generation	CodeCode Available	7
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation	Jun 22, 2025	GPUImage Generation	CodeCode Available	3
Programmable-Room: Interactive Textured 3D Room Meshes Generation Empowered by Large Language Models	Jun 21, 2025	AttributeImage Generation	—Unverified	0
DreamCube: 3D Panorama Generation via Multi-plane Synchronization	Jun 20, 2025	Depth EstimationImage Generation	—Unverified	0
The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation	Jun 20, 2025	Image GenerationQuantization	—Unverified	0
AI's Blind Spots: Geographic Knowledge and Diversity Deficit in Generated Urban Scenario	Jun 20, 2025	DiversityEntity Disambiguation	—Unverified	0
Beyond Blur: A Fluid Perspective on Generative Diffusion Models	Jun 20, 2025	DiversityGPU	—Unverified	0
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens	Jun 20, 2025	Image GenerationMultimodal Reasoning	CodeCode Available	3
Category-based Galaxy Image Generation via Diffusion Models	Jun 19, 2025	AstronomyImage Generation	—Unverified	0
PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models	Jun 19, 2025	Image GenerationQuantization	—Unverified	0
Watermarking Autoregressive Image Generation	Jun 19, 2025	Image GenerationLanguage Modeling	CodeCode Available	2
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model	Jun 18, 2025	Image Generation	CodeCode Available	1
Align Your Flow: Scaling Continuous-Time Flow Map Distillation	Jun 17, 2025	Image Generation	—Unverified	0
DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion	Jun 17, 2025	DenoisingImage Generation	—Unverified	0
Risk Estimation of Knee Osteoarthritis Progression via Predictive Multi-task Modelling from Efficient Diffusion Model using X-ray Images	Jun 17, 2025	Image GenerationInterpretable Machine Learning	—Unverified	0
Cost-Aware Routing for Efficient Text-To-Image Generation	Jun 17, 2025	DenoisingImage Generation	—Unverified	0
VideoMAR: Autoregressive Video Generatio with Continuous Tokens	Jun 17, 2025	GPUImage Generation	—Unverified	0
Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models	Jun 17, 2025	Attributecounterfactual	—Unverified	0
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention	Jun 16, 2025	AttributeImage Generation	—Unverified	0
Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis	Jun 16, 2025	BenchmarkingData Augmentation	—Unverified	0

Show:10 25 50

← PrevPage 3 of 268Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified