Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1951–2000 of 6689 papers

Title	Date	Tasks	Status	Hype	Score
Enlisting 3D Crop Models and GANs for More Data Efficient and Generalizable Fruit Detection	Aug 30, 2021	Domain AdaptationGenerative Adversarial Network	CodeCode Available	1	5
Nested Hierarchical Transformer: Towards Accurate, Data-Efficient and Interpretable Visual Understanding	May 26, 2021	DecoderImage Classification	CodeCode Available	1	5
Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models	Mar 11, 2024	Image Generation	CodeCode Available	1	5
CoDi: Conditional Diffusion Distillation for Higher-Fidelity and Faster Image Generation	Oct 2, 2023	Image EnhancementImage Generation	CodeCode Available	1	5
Multi-marginal Wasserstein GAN	Nov 3, 2019	Image GenerationTranslation	CodeCode Available	1	5
A Characteristic Function Approach to Deep Implicit Generative Modeling	Sep 16, 2019	Image Generation	CodeCode Available	1	5
Ensembling Off-the-shelf Models for GAN Training	Dec 16, 2021	Image Generation	CodeCode Available	1	5
A Survey on Leveraging Pre-trained Generative Adversarial Networks for Image Editing and Restoration	Jul 21, 2022	Image GenerationImage Restoration	CodeCode Available	1	5
Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models	Nov 28, 2023	Image GenerationScene Text Detection	CodeCode Available	1	5
Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction	Dec 5, 2020	Image Generation	CodeCode Available	1	5
Multi-Modality Generative Adversarial Networks with Tumor Consistency Loss for Brain MR Image Synthesis	May 2, 2020	Generative Adversarial NetworkImage Generation	CodeCode Available	1	5
Multimodal Image Synthesis with Conditional Implicit Maximum Likelihood Estimation	Apr 7, 2020	Image GenerationImage Super-Resolution	CodeCode Available	1	5
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections	May 30, 2024	Image Generation	CodeCode Available	1	5
One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild	Sep 15, 2024	GPUImage Generation	CodeCode Available	1	5
Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion	May 26, 2025	DenoisingImage Generation	CodeCode Available	1	5
E2NeRF: Event Enhanced Neural Radiance Fields from Blurry Images	Jan 1, 2023	Camera Pose EstimationDeblurring	CodeCode Available	1	5
ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face Synthesis	Apr 21, 2024	3D-Aware Image SynthesisContrastive Learning	CodeCode Available	1	5
EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs	Nov 30, 2021	GPUImage Generation	CodeCode Available	1	5
Omni-Dish: Photorealistic and Faithful Image Generation and Editing for Arbitrary Chinese Dishes	Apr 14, 2025	Image GenerationLarge Language Model	CodeCode Available	1	5
UIEDP:Underwater Image Enhancement with Diffusion Prior	Dec 11, 2023	Image EnhancementImage Generation	CodeCode Available	1	5
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models	Oct 28, 2024	Adversarial TextImage Generation	CodeCode Available	1	5
Omni-GAN: On the Secrets of cGANs and Beyond	Nov 26, 2020	Conditional Image GenerationGenerative Adversarial Network	CodeCode Available	1	5
A Survey on Deep Generative 3D-aware Image Synthesis	Oct 25, 2022	3D-Aware Image SynthesisImage Generation	CodeCode Available	1	5
Uncurated Image-Text Datasets: Shedding Light on Demographic Bias	Apr 6, 2023	Image CaptioningImage Generation	CodeCode Available	1	5
EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models	Jan 9, 2024	DenoisingImage Generation	CodeCode Available	1	5
Conceptrol: Concept Control of Zero-shot Personalized Image Generation	Mar 9, 2025	Image GenerationPersonalized Image Generation	CodeCode Available	1	5
Octree Transformer: Autoregressive 3D Shape Generation on Hierarchically Structured Sequences	Nov 24, 2021	3D Shape GenerationImage Generation	CodeCode Available	1	5
Off-Policy Reinforcement Learning for Efficient and Effective GAN Architecture Search	Jul 17, 2020	Generative Adversarial NetworkGPU	CodeCode Available	1	5
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning	May 29, 2024	Image GenerationText to Image Generation	CodeCode Available	1	5
Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis	Apr 13, 2022	3D-Aware Image Synthesis3D geometry	CodeCode Available	1	5
Energy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion Models	Jun 16, 2023	DenoisingImage Generation	CodeCode Available	1	5
NViSII: A Scriptable Tool for Photorealistic Image Generation	May 28, 2021	Image GenerationOptical Flow Estimation	CodeCode Available	1	5
End-to-End Diffusion Latent Optimization Improves Classifier Guidance	Mar 23, 2023	DenoisingImage Generation	CodeCode Available	1	5
Object-Centric Slot Diffusion	Mar 20, 2023	Image GenerationImage Segmentation	CodeCode Available	1	5
ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis	Nov 11, 2024	Image Generation	CodeCode Available	1	5
Concept Conductor: Orchestrating Multiple Personalized Concepts in Text-to-Image Synthesis	Aug 7, 2024	AttributeImage Generation	CodeCode Available	1	5
End-to-End Differentiable Learning to HDR Image Synthesis for Multi-exposure Images	Jun 29, 2020	Image GenerationImage Reconstruction	CodeCode Available	1	5
Enhanced Balancing GAN: Minority-class Image Generation	Oct 31, 2020	Image Generation	CodeCode Available	1	5
OCR-VQGAN: Taming Text-within-Image Generation	Oct 19, 2022	ArticlesDecoder	CodeCode Available	1	5
Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic Image Synthesis	Jul 22, 2023	Contrastive LearningImage Generation	CodeCode Available	1	5
OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks	May 24, 2025	Image GenerationInstruction Following	CodeCode Available	1	5
Edge-preserving noise for diffusion models	Oct 2, 2024	DenoisingImage Generation	CodeCode Available	1	5
One-Shot Structure-Aware Stylized Image Synthesis	Feb 27, 2024	Image GenerationImage Stylization	CodeCode Available	1	5
NaturalInversion: Data-Free Image Synthesis Improving Real-World Consistency	Jun 29, 2023	Image GenerationKnowledge Distillation	CodeCode Available	1	5
Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization	Oct 16, 2024	Image Generation	CodeCode Available	1	5
Elucidating the solution space of extended reverse-time SDE for diffusion models	Sep 12, 2023	Image Generation	CodeCode Available	1	5
Elucidating the design space of language models for image generation	Oct 21, 2024	Image GenerationText Generation	CodeCode Available	1	5
Negative Data Augmentation	Feb 9, 2021	Action RecognitionAnomaly Detection	CodeCode Available	1	5
Elucidating the Exposure Bias in Diffusion Models	Aug 29, 2023	AttributeImage Generation	CodeCode Available	1	5
No Token Left Behind: Explainability-Aided Image Classification and Generation	Apr 11, 2022	image-classificationImage Classification	CodeCode Available	1	5

Show:10 25 50

← PrevPage 40 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified