Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4426–4450 of 6689 papers

Title	Date	Tasks	Status	Hype
Synthetic ID Card Image Generation for Improving Presentation Attack Detection	Oct 31, 2022	Fraud DetectionImage Generation	—Unverified	0
gCoRF: Generative Compositional Radiance Fields	Oct 31, 2022	Image Generation	—Unverified	0
Recursive Reasoning in Minimax Games: A Level k Gradient Play Method	Oct 29, 2022	GPUImage Generation	CodeCode Available	0
Few-shot Image Generation via Adaptation-Aware Kernel Modulation	Oct 29, 2022	10-shot image generationDomain Adaptation	CodeCode Available	1
UPainting: Unified Text-to-Image Diffusion Generation with Cross-modal Guidance	Oct 28, 2022	Image GenerationImage-text matching	—Unverified	0
Latent Space is Feature Space: Regularization Term for GANs Training on Limited Dataset	Oct 28, 2022	Data AugmentationDiversity	CodeCode Available	0
Accelerating Diffusion Models via Pre-segmentation Diffusion Sampling for Medical Image Segmentation	Oct 27, 2022	Conditional Image GenerationDenoising	—Unverified	0
A Generic Shared Attention Mechanism for Various Backbone Neural Networks	Oct 27, 2022	Data Augmentationimage-classification	—Unverified	0
ScoreMix: A Scalable Augmentation Strategy for Training GANs with Limited Data	Oct 27, 2022	Data AugmentationDiversity	—Unverified	0
SSD: Towards Better Text-Image Consistency Metric in Text-to-Image Generation	Oct 27, 2022	Image GenerationSemantic Similarity	CodeCode Available	0
Deep Generative Models on 3D Representations: A Survey	Oct 27, 2022	3D-Aware Image Synthesis3D Shape Generation	CodeCode Available	3
ERNIE-ViLG 2.0: Improving Text-to-Image Diffusion Model with Knowledge-Enhanced Mixture-of-Denoising-Experts	Oct 27, 2022	DenoisingImage Generation	CodeCode Available	0
Few-shot Image Generation via Masked Discrimination	Oct 27, 2022	DiversityImage Generation	CodeCode Available	0
How well can Text-to-Image Generative Models understand Ethical Natural Language Interventions?	Oct 27, 2022	Cultural Vocal Bursts Intensity PredictionDiversity	CodeCode Available	1
Towards Practicality of Sketch-Based Visual Understanding	Oct 27, 2022	Image GenerationImage Retrieval	—Unverified	0
Towards the Detection of Diffusion Model Deepfakes	Oct 26, 2022	AttributeImage Generation	CodeCode Available	1
TPFNet: A Novel Text In-painting Transformer for Text Removal	Oct 26, 2022	Image GenerationSegmentation	CodeCode Available	1
A Survey on Deep Generative 3D-aware Image Synthesis	Oct 25, 2022	3D-Aware Image SynthesisImage Generation	CodeCode Available	1
Lafite2: Few-shot Text-to-Image Generation	Oct 25, 2022	Image GenerationRetrieval	—Unverified	0
Learning Latent Structural Causal Models	Oct 24, 2022	Bayesian InferenceImage Generation	—Unverified	0
High-Resolution Image Editing via Multi-Stage Blended Diffusion	Oct 24, 2022	Image GenerationImage Inpainting	CodeCode Available	1
Photo-realistic Neural Domain Randomization	Oct 23, 2022	Depth EstimationImage Generation	—Unverified	0
A Visual Tour Of Current Challenges In Multimodal Language Models	Oct 22, 2022	Image GenerationText to Image Generation	—Unverified	0
Instance-Aware Image Completion	Oct 22, 2022	Image Generationobject-detection	—Unverified	0
Efficient Hair Style Transfer with Generative Adversarial Networks	Oct 22, 2022	Image GenerationStyle Transfer	—Unverified	0

Show:10 25 50

← PrevPage 178 of 268Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified