Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 226–250 of 6689 papers

Title	Date	Tasks	Status	Hype
Latent Flow Transformer	May 20, 2025	Image Generation	CodeCode Available	0
Adaptive Cyclic Diffusion for Inference Scaling	May 20, 2025	Computational EfficiencyDenoising	—Unverified	0
Emerging Properties in Unified Multimodal Pretraining	May 20, 2025	Image Editing	CodeCode Available	9
"Haet Bhasha aur Diskrimineshun": Phonetic Perturbations in Code-Mixed Hinglish to Red-Team LLMs	May 20, 2025	Image GenerationRed Teaming	—Unverified	0
Instructing Text-to-Image Diffusion Models via Classifier-Guided Semantic Optimization	May 20, 2025	AttributeDisentanglement	CodeCode Available	0
Hunyuan-Game: Industrial-grade Intelligent Game Creation Model	May 20, 2025	Image GenerationImage to Video Generation	—Unverified	0
UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation	May 20, 2025	Image GenerationLanguage Modeling	—Unverified	0
Sparc3D: Sparse Representation and Construction for High-Resolution 3D Shapes Modeling	May 20, 2025	3D Generation3D Reconstruction	—Unverified	0
VisualQuality-R1: Reasoning-Induced Image Quality Assessment via Reinforcement Learning to Rank	May 20, 2025	Image GenerationImage Quality Assessment	CodeCode Available	2
Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives	May 20, 2025	Caption GenerationContrastive Learning	—Unverified	0
Training-Free Watermarking for Autoregressive Image Generation	May 20, 2025	Image Generation	CodeCode Available	1
Improving Compositional Generation with Diffusion Models Using Lift Scores	May 19, 2025	Image GenerationPosition	CodeCode Available	0
Accelerate TarFlow Sampling with GS-Jacobi Iteration	May 19, 2025	Image Generation	CodeCode Available	1
Swin DiT: Diffusion Transformer using Pseudo Shifted Windows	May 19, 2025	Image Generation	CodeCode Available	0
Diffusion Models with Double Guidance: Generate with aggregated datasets	May 19, 2025	Image Generation	—Unverified	0
FRAbench and GenEval: Scaling Fine-Grained Aspect Evaluation across Tasks, Modalities	May 19, 2025	Image GenerationText Generation	—Unverified	0
SounDiT: Geo-Contextual Soundscape-to-Landscape Generation	May 19, 2025	Image Generation	—Unverified	0
Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking	May 19, 2025	Image GenerationMamba	—Unverified	0
Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model	May 19, 2025	DecoderDenoising	CodeCode Available	0
Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking	May 19, 2025	Image GenerationObject Tracking	—Unverified	0
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO	May 19, 2025	DecoderImage Generation	CodeCode Available	0
VTBench: Evaluating Visual Tokenizers for Autoregressive Image Generation	May 19, 2025	Image GenerationImage Reconstruction	CodeCode Available	1
A Physics-Inspired Optimizer: Velocity Regularized Adam	May 19, 2025	image-classificationImage Classification	—Unverified	0
Exploring Sparsity for Parameter Efficient Fine Tuning Using Wavelets	May 18, 2025	DiversityImage Generation	CodeCode Available	0
Context-Aware Autoregressive Models for Multi-Conditional Image Generation	May 18, 2025	Conditional Image GenerationImage Generation	—Unverified	0

Show:10 25 50

← PrevPage 10 of 268Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified