Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 776–800 of 6689 papers

Title	Date	Tasks	Status	Hype
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception	Apr 15, 2025	Data AugmentationDenoising	CodeCode Available	1
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing	Apr 14, 2025	Image GenerationText to Image Generation	CodeCode Available	1
GeoUni: A Unified Model for Generating Geometry Diagrams, Problems and Problem Solutions	Apr 14, 2025	Image Generation	CodeCode Available	1
Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes	Apr 14, 2025	Image GenerationLarge Language Model	CodeCode Available	1
LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs	Apr 11, 2025	BenchmarkingImage Generation	CodeCode Available	1
Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging	Apr 11, 2025	AttributeComputational Efficiency	CodeCode Available	1
ID-Booth: Identity-consistent Face Generation with Diffusion Models	Apr 10, 2025	DenoisingDiversity	CodeCode Available	1
A Unified Agentic Framework for Evaluating Conditional Image Generation	Apr 9, 2025	Conditional Image GenerationImage Generation	CodeCode Available	1
DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Apr 9, 2025	Image GenerationText to Image Generation	CodeCode Available	1
An Empirical Study of GPT-4o Image Generation Capabilities	Apr 8, 2025	BenchmarkingImage Generation	CodeCode Available	1
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking	Apr 8, 2025	Image Generation	CodeCode Available	1
MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization	Apr 1, 2025	Image GenerationImage Reconstruction	CodeCode Available	1
Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability	Mar 26, 2025	Age/UnbiasedDecision Making	CodeCode Available	1
Equivariant Image Modeling	Mar 24, 2025	Image GenerationZero-shot Generalization	CodeCode Available	1
U-REPA: Aligning Diffusion U-Nets to ViTs	Mar 24, 2025	Image Generation	CodeCode Available	1
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models	Mar 24, 2025	Image GenerationSuper-Resolution	CodeCode Available	1
DeLoRA: Decoupling Angles and Strength in Low-rank Adaptation	Mar 23, 2025	Image GenerationNatural Language Understanding	CodeCode Available	1
Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation	Mar 23, 2025	DiversityImage Generation	CodeCode Available	1
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis	Mar 19, 2025	Image Generation	CodeCode Available	1
Efficient Personalization of Quantized Diffusion Model without Backpropagation	Mar 19, 2025	Image Generation	CodeCode Available	1
SimWorld: A Unified Benchmark for Simulator-Conditioned Scene Generation via World Model	Mar 18, 2025	Autonomous DrivingImage Generation	CodeCode Available	1
DPImageBench: A Unified Benchmark for Differentially Private Image Synthesis	Mar 18, 2025	Image Generation	CodeCode Available	1
Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation	Mar 16, 2025	Image GenerationSpecificity	CodeCode Available	1
Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models	Mar 14, 2025	Image GenerationNovel View Synthesis	CodeCode Available	1
AdvPaint: Protecting Images from Inpainting Manipulation via Adversarial Attention Disruption	Mar 13, 2025	Image Generation	CodeCode Available	1

Show:10 25 50

← PrevPage 32 of 268Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified