Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5826–5850 of 6689 papers

Title	Date	Tasks	Status
Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language Conditioning	Oct 17, 2024	Data AugmentationImage Generation	CodeCode Available
Active Generation for Image Classification	Mar 11, 2024	Active LearningClassification	CodeCode Available
Triangle Generative Adversarial Networks	Sep 19, 2017	AttributeGenerative Adversarial Network	CodeCode Available
Learning Modality-Aware Representations: Adaptive Group-wise Interaction Network for Multimodal MRI Synthesis	Nov 22, 2024	Image GenerationImage-to-Image Translation	CodeCode Available
TorchGAN: A Flexible Framework for GAN Training and Evaluation	Sep 8, 2019	Conditional Image GenerationImage Generation	CodeCode Available
Improved Variational Inference with Inverse Autoregressive Flow	Dec 1, 2016	Image GenerationVariational Inference	CodeCode Available
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework	Feb 7, 2022	Image Captioningimage-classification	CodeCode Available
Improving Fine-Grained Control via Aggregation of Multiple Diffusion Models	Oct 2, 2024	Image Generation	CodeCode Available
Phase-Based Frame Interpolation for Video	Jun 1, 2015	global-optimizationImage Generation	CodeCode Available
A Prompt Log Analysis of Text-to-Image Generation Systems	Mar 8, 2023	Image GenerationText to Image Generation	CodeCode Available
Diverse Image Generation with Diffusion Models and Cross Class Label Learning for Polyp Classification	Feb 8, 2025	Image GenerationMedical Procedure	CodeCode Available
Exposing Image Splicing Traces in Scientific Publications via Uncertainty-guided Refinement	Sep 28, 2023	Image ForensicsImage Generation	CodeCode Available
Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell Segmentation	Nov 18, 2024	Cell SegmentationImage Generation	CodeCode Available
Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination	May 25, 2024	Image Generation	CodeCode Available
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation	Jun 15, 2024	AudioCapsImage Generation	CodeCode Available
MintNet: Building Invertible Neural Networks with Masked Convolutions	Jul 18, 2019	Image Generation	CodeCode Available
MirrorGAN: Learning Text-to-image Generation by Redescription	Mar 14, 2019	DiversityImage Generation	CodeCode Available
Improving Compositional Generation with Diffusion Models Using Lift Scores	May 19, 2025	Image GenerationPosition	CodeCode Available
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network	Feb 26, 2018	Image GenerationSemantic Similarity	CodeCode Available
MISFIT-V: Misaligned Image Synthesis and Fusion using Information from Thermal and Visual	Sep 22, 2023	Generative Adversarial NetworkImage Generation	CodeCode Available
FaceForensics++: Learning to Detect Manipulated Facial Images	Oct 1, 2019	Face SwappingImage Generation	CodeCode Available
Diffusion-HMC: Parameter Inference with Diffusion-model-driven Hamiltonian Monte Carlo	May 8, 2024	Image Generation	CodeCode Available
Analyzing the Feature Extractor Networks for Face Image Synthesis	Jun 4, 2024	BenchmarkingImage Generation	CodeCode Available
Improving Diffusion-Based Generative Models via Approximated Optimal Transport	Mar 8, 2024	Image Generation	CodeCode Available
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual Grounding	Aug 29, 2024	Data AugmentationImage Generation	CodeCode Available

Show:10 25 50

← PrevPage 234 of 268Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified