Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 701–750 of 6689 papers

Title	Date	Tasks	Status	Hype
Diffusion Facial Forgery Detection	Jan 29, 2024	Image Generation	CodeCode Available	2
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks	May 5, 2021	image-classificationImage Classification	CodeCode Available	2
A Style-Based Generator Architecture for Generative Adversarial Networks	Dec 12, 2018	DisentanglementImage Generation	CodeCode Available	2
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition	May 22, 2024	Image Generation	CodeCode Available	2
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis	Jan 30, 2023	Image GenerationScene Understanding	CodeCode Available	2
Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization	Jun 24, 2024	Consistent Character GenerationImage Generation	CodeCode Available	2
Character-Aware Models Improve Visual Text Rendering	Dec 20, 2022	Image Generation	CodeCode Available	2
Diffusion Models Beat GANs on Image Synthesis	May 11, 2021	Conditional Image GenerationDiversity	CodeCode Available	2
FouriScale: A Frequency Perspective on Training-Free High-Resolution Image Synthesis	Mar 19, 2024	Image GenerationText to Image Generation	CodeCode Available	2
Pix2NeRF: Unsupervised Conditional p-GAN for Single Image to Neural Radiance Fields Translation	Jan 1, 2022	3D-Aware Image SynthesisImage Generation	CodeCode Available	2
CharaConsist: Fine-Grained Consistent Character Generation	Jul 15, 2025	Consistent Character GenerationImage Generation	CodeCode Available	2
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions	Feb 24, 2025	Data AugmentationImage Generation	CodeCode Available	2
GAN Compression: Efficient Architectures for Interactive Conditional GANs	Mar 19, 2020	Image GenerationNeural Architecture Search	CodeCode Available	2
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens	Oct 17, 2024	Image GenerationText to Image Generation	CodeCode Available	2
Diffusion Recommender Model	Apr 11, 2023	DenoisingImage Generation	CodeCode Available	2
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner	Sep 26, 2024	Image GenerationText to Image Generation	CodeCode Available	2
Playable Game Generation	Dec 1, 2024	GPUImage Generation	CodeCode Available	2
Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation	Nov 22, 2022	Image GenerationImage-to-Image Translation	CodeCode Available	2
Flux Already Knows -- Activating Subject-Driven Image Generation without Training	Apr 12, 2025	Image GenerationVirtual Try-on	CodeCode Available	2
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance	Apr 8, 2025	Image Generation	CodeCode Available	2
ProcessPainter: Learn Painting Process from Sequence Data	Jun 10, 2024	DenoisingImage Generation	CodeCode Available	2
Progressive Distillation for Fast Sampling of Diffusion Models	Feb 1, 2022	Density EstimationImage Generation	CodeCode Available	2
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching	May 29, 2024	compressed sensingDeblurring	CodeCode Available	2
Flow Matching in Latent Space	Jul 17, 2023	Computational EfficiencyImage Generation	CodeCode Available	2
Causal Diffusion Transformers for Generative Modeling	Dec 16, 2024	DecoderImage Generation	CodeCode Available	2
Flow-Guided Diffusion for Video Inpainting	Nov 26, 2023	DenoisingImage Generation	CodeCode Available	2
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching	Dec 19, 2024	Image GenerationPrediction	CodeCode Available	2
Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality	Mar 1, 2025	Image EnhancementImage Generation	CodeCode Available	2
Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation	May 31, 2024	3D GenerationImage Generation	CodeCode Available	2
GAN Prior Embedded Network for Blind Face Restoration in the Wild	May 13, 2021	Blind Face RestorationDecoder	CodeCode Available	2
Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code	Oct 2, 2023	Image GenerationText-based Image Editing	CodeCode Available	2
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator	Mar 3, 2025	Image Generation	CodeCode Available	2
DiSA: Diffusion Step Annealing in Autoregressive Image Generation	May 26, 2025	DenoisingImage Generation	CodeCode Available	2
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents	Jul 3, 2024	Image GenerationMolecular Docking	CodeCode Available	2
Reformer: The Efficient Transformer	Jan 13, 2020	D4RLImage Generation	CodeCode Available	2
Geometry-Complete Diffusion for 3D Molecule Generation and Optimization	Feb 8, 2023	3D Molecule GenerationDenoising	CodeCode Available	2
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model	Mar 10, 2025	Image DescriptionImage Generation	CodeCode Available	2
FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion	Mar 6, 2024	Fire DetectionImage Generation	CodeCode Available	1
A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models	Aug 12, 2024	Image Generation	CodeCode Available	1
First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending	Oct 14, 2024	Image GenerationText Generation	CodeCode Available	1
Finite Scalar Quantization: VQ-VAE Made Simple	Sep 27, 2023	ColorizationDepth Estimation	CodeCode Available	1
A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers	Oct 9, 2023	Heart SegmentationImage Generation	CodeCode Available	1
A Simple and Effective Baseline for Attentional Generative Adversarial Networks	Jun 26, 2023	Image Generation	CodeCode Available	1
FlexDiT: Dynamic Token Density Control for Diffusion Transformer	Dec 8, 2024	Computational EfficiencyDenoising	CodeCode Available	1
Finetuning CLIP to Reason about Pairwise Differences	Sep 15, 2024	AttributeContrastive Learning	CodeCode Available	1
A Shared Representation for Photorealistic Driving Simulators	Dec 9, 2021	Autonomous VehiclesImage Generation	CodeCode Available	1
A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis	Oct 29, 2021	3D-Aware Image Synthesis3D Shape Reconstruction	CodeCode Available	1
Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling	Jul 5, 2022	DiversityImage Generation	CodeCode Available	1
FineDiffusion: Scaling up Diffusion Models for Fine-grained Image Generation with 10,000 Classes	Feb 28, 2024	Conditional Image GenerationImage Generation	CodeCode Available	1
Feynman-Kac Correctors in Diffusion: Annealing, Guidance, and Product of Experts	Mar 4, 2025	Image GenerationText to Image Generation	CodeCode Available	1

Show:10 25 50

← PrevPage 15 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified