Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–225 of 6689 papers

Title	Date	Tasks	Status	Hype
Deciphering Oracle Bone Language with Diffusion Models	Jun 2, 2024	DeciphermentImage Generation	CodeCode Available	3
DDT: Decoupled Diffusion Transformer	Apr 8, 2025	DenoisingImage Generation	CodeCode Available	3
Kiss3DGen: Repurposing Image Diffusion Models for 3D Asset Generation	Mar 3, 2025	3D Generation3D Reconstruction	CodeCode Available	3
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models	Feb 8, 2022	DiagnosticImage Captioning	CodeCode Available	3
3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering	Jan 9, 2025	Image GenerationText to Image Generation	CodeCode Available	3
AP-LDM: Attentive and Progressive Latent Diffusion Model for Training-Free High-Resolution Image Generation	Oct 8, 2024	DenoisingImage Generation	CodeCode Available	3
3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation	Oct 16, 2024	AttributeImage Generation	CodeCode Available	3
CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation	Oct 12, 2024	Conditional Image GenerationGPU	CodeCode Available	3
Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative Framework	Oct 28, 2024	Image GenerationImage Manipulation	CodeCode Available	3
On the Trajectory Regularity of ODE-based Diffusion Sampling	May 18, 2024	DenoisingImage Generation	CodeCode Available	3
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation	Feb 19, 2024	Image Generation	CodeCode Available	3
Behavior Generation with Latent Actions	Mar 5, 2024	Autonomous DrivingDecision Making	CodeCode Available	3
Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models	Jan 1, 2024	Image GenerationText to Image Generation	CodeCode Available	3
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation	Sep 12, 2023	GPUImage Generation	CodeCode Available	3
Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer	May 7, 2024	Image GenerationSuper-Resolution	CodeCode Available	3
Deep Generative Models on 3D Representations: A Survey	Oct 27, 2022	3D-Aware Image Synthesis3D Shape Generation	CodeCode Available	3
PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360deg	Jan 1, 2023	Image GenerationImage Segmentation	CodeCode Available	3
Personalized Image Generation with Deep Generative Models: A Decade Survey	Feb 18, 2025	Image GenerationPersonalized Image Generation	CodeCode Available	3
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis	Oct 10, 2024	Feature CompressionImage Generation	CodeCode Available	3
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction	Apr 1, 2025	Image Generation	CodeCode Available	3
ControlAR: Controllable Image Generation with Autoregressive Models	Oct 3, 2024	Image Generation	CodeCode Available	3
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation	Aug 19, 2024	Image GenerationVideo Generation	CodeCode Available	3
An Image is Worth 32 Tokens for Reconstruction and Generation	Jun 11, 2024	Image GenerationImage Reconstruction	CodeCode Available	3
Consistency Models Made Easy	Jun 20, 2024	Computational EfficiencyGPU	CodeCode Available	3
Consistency Flow Matching: Defining Straight Flows with Velocity Consistency	Jul 2, 2024	Image Generation	CodeCode Available	3

Show:10 25 50

← PrevPage 9 of 268Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified