Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–650 of 6689 papers

Title	Date	Tasks	Status	Hype	Score
Generative Enhancement for 3D Medical Images	Mar 19, 2024	counterfactualImage Generation	CodeCode Available	2	5
GAUDI: A Neural Architect for Immersive 3D Scene Generation	Jul 27, 2022	Image GenerationScene Generation	CodeCode Available	2	5
GANSpace: Discovering Interpretable GAN Controls	Apr 6, 2020	Image Generation	CodeCode Available	2	5
Gaussian Mixture Flow Matching Models	Apr 7, 2025	DenoisingImage Generation	CodeCode Available	2	5
Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective	Aug 13, 2024	Image GenerationSynthetic Image Detection	CodeCode Available	2	5
Improving the Stability and Efficiency of Diffusion Models for Content Consistent Super-Resolution	Dec 30, 2023	DecoderImage Generation	CodeCode Available	2	5
GAN Compression: Efficient Architectures for Interactive Conditional GANs	Mar 19, 2020	Image GenerationNeural Architecture Search	CodeCode Available	2	5
Dense Text-to-Image Generation with Attention Modulation	Aug 24, 2023	Image GenerationText to Image Generation	CodeCode Available	2	5
GAN Prior Embedded Network for Blind Face Restoration in the Wild	May 13, 2021	Blind Face RestorationDecoder	CodeCode Available	2	5
Closed-Form Factorization of Latent Semantics in GANs	Jul 13, 2020	AttributeForm	CodeCode Available	2	5
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition	Feb 23, 2024	Image GenerationPersonalized Image Generation	CodeCode Available	2	5
From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation	Apr 23, 2024	Image Generation	CodeCode Available	2	5
A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models	Jul 24, 2023	Image GenerationImage-text matching	CodeCode Available	2	5
From Text to Pose to Image: Improving Diffusion Model Control and Quality	Nov 19, 2024	Image GenerationPrompt Engineering	CodeCode Available	2	5
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos	Jul 23, 2024	Image GenerationPoint Tracking	CodeCode Available	2	5
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition	May 22, 2024	Image Generation	CodeCode Available	2	5
Autonomous Improvement of Instruction Following Skills via Foundation Models	Jul 30, 2024	Image GenerationInstruction Following	CodeCode Available	2	5
Character-Aware Models Improve Visual Text Rendering	Dec 20, 2022	Image Generation	CodeCode Available	2	5
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis	Jan 30, 2023	Image GenerationScene Understanding	CodeCode Available	2	5
GenAI Arena: An Open Evaluation Platform for Generative Models	Jun 6, 2024	Image GenerationInstruction Following	CodeCode Available	2	5
AutoPresent: Designing Structured Visuals from Scratch	Jan 1, 2025	Image Generation	CodeCode Available	2	5
Generative Image as Action Models	Jul 10, 2024	Image GenerationRobot Manipulation	CodeCode Available	2	5
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation	Oct 27, 2024	Image GenerationText to Image Generation	CodeCode Available	2	5
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner	Sep 26, 2024	Image GenerationText to Image Generation	CodeCode Available	2	5
Autoregressive Image Generation with Randomized Parallel Decoding	Mar 13, 2025	Conditional Image GenerationImage Generation	CodeCode Available	2	5
Denoising Diffusion Probabilistic Models	Jun 19, 2020	DenoisingDensity Estimation	CodeCode Available	2	5
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens	Oct 17, 2024	Image GenerationText to Image Generation	CodeCode Available	2	5
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching	May 29, 2024	compressed sensingDeblurring	CodeCode Available	2	5
Flow Matching in Latent Space	Jul 17, 2023	Computational EfficiencyImage Generation	CodeCode Available	2	5
Flux Already Knows -- Activating Subject-Driven Image Generation without Training	Apr 12, 2025	Image GenerationVirtual Try-on	CodeCode Available	2	5
Flow-Guided Diffusion for Video Inpainting	Nov 26, 2023	DenoisingImage Generation	CodeCode Available	2	5
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better	Jun 10, 2025	Image Generation	CodeCode Available	2	5
Flow-Anchored Consistency Models	Jul 4, 2025	Image Generation	CodeCode Available	2	5
BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities	Oct 18, 2024	Conditional Image GenerationImage Generation	CodeCode Available	2	5
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching	Dec 19, 2024	Image GenerationPrediction	CodeCode Available	2	5
Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality	Mar 1, 2025	Image EnhancementImage Generation	CodeCode Available	2	5
Latent Guard: a Safety Framework for Text-to-image Generation	Apr 11, 2024	Contrastive LearningImage Generation	CodeCode Available	2	5
Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures	Nov 14, 2022	3D GenerationImage Generation	CodeCode Available	2	5
Denoising Diffusion Bridge Models	Sep 29, 2023	DenoisingImage Generation	CodeCode Available	2	5
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation	Mar 25, 2024	DenoisingImage Generation	CodeCode Available	2	5
Denoising Diffusion Implicit Models	Oct 6, 2020	DenoisingImage Generation	CodeCode Available	2	5
Diffusion Models for Tabular Data: Challenges, Current Progress, and Future Directions	Feb 24, 2025	Data AugmentationImage Generation	CodeCode Available	2	5
Agent Attention: On the Integration of Softmax and Linear Attention	Dec 14, 2023	Computational Efficiencyimage-classification	CodeCode Available	2	5
Fixed Point Diffusion Models	Jan 16, 2024	DenoisingImage Generation	CodeCode Available	2	5
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks	May 5, 2021	image-classificationImage Classification	CodeCode Available	2	5
Denoising Diffusion Models for Plug-and-Play Image Restoration	May 15, 2023	DeblurringDenoising	CodeCode Available	2	5
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient	Nov 26, 2024	GPUImage Generation	CodeCode Available	2	5
Collaborative Neural Rendering using Anime Character Sheets	Jul 12, 2022	Image GenerationImage to 3D	CodeCode Available	2	5
Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference	May 27, 2023	GPUImage Generation	CodeCode Available	2	5
FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction	Feb 27, 2025	Image GenerationPrediction	CodeCode Available	2	5

Show:10 25 50

← PrevPage 13 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified