Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2176–2200 of 6689 papers

Title	Date	Tasks	Status	Hype
Amortizing intractable inference in diffusion models for vision, language, and control	May 31, 2024	continuous-controlContinuous Control	CodeCode Available	1
Enhancing Counterfactual Image Generation Using Mahalanobis Distance with Distribution Preferences in Feature Space	May 31, 2024	counterfactualCounterfactual Explanation	—Unverified	0
Learning 3D Robotics Perception using Inductive Priors	May 30, 2024	3D ReconstructionImage Generation	—Unverified	0
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow	May 30, 2024	DiversityImage Generation	CodeCode Available	1
RIGID: A Training-free and Model-Agnostic Framework for Robust AI-Generated Image Detection	May 30, 2024	Image Generation	—Unverified	0
Boost Your Human Image Generation Model via Direct Preference Optimization	May 30, 2024	AnatomyImage Generation	—Unverified	0
Improving the Training of Rectified Flows	May 30, 2024	Image GenerationKnowledge Distillation	CodeCode Available	2
ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections	May 30, 2024	Image Generation	CodeCode Available	1
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture	May 29, 2024	Image GenerationVideo Generation	CodeCode Available	7
SketchDeco: Decorating B&W Sketches with Colour	May 29, 2024	Image ColorizationImage Generation	CodeCode Available	2
Topological Perspectives on Optimal Multimodal Embedding Spaces	May 29, 2024	Image GenerationText to Image Generation	—Unverified	0
Inpaint Biases: A Pathway to Accurate and Unbiased Image Generation	May 29, 2024	Image Generation	—Unverified	0
Zero-to-Hero: Enhancing Zero-Shot Novel View Synthesis via Attention Map Filtering	May 29, 2024	DenoisingImage Generation	—Unverified	0
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching	May 29, 2024	compressed sensingDeblurring	CodeCode Available	2
MemControl: Mitigating Memorization in Diffusion Models via Automated Parameter Selection	May 29, 2024	Image GenerationMedical Image Generation	CodeCode Available	0
SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation	May 29, 2024	Image GenerationImage Retrieval	—Unverified	0
Quo Vadis ChatGPT? From Large Language Models to Large Knowledge Models	May 29, 2024	Image GenerationText Summarization	—Unverified	0
The ethical situation of DALL-E 2	May 29, 2024	Image Generation	—Unverified	0
Patch-enhanced Mask Encoder Prompt Image Generation	May 29, 2024	Image Generation	—Unverified	0
Going beyond Compositions, DDPMs Can Produce Zero-Shot Interpolations	May 29, 2024	DenoisingImage Generation	CodeCode Available	0
ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning	May 29, 2024	Image GenerationText to Image Generation	CodeCode Available	1
Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers?	May 28, 2024	DiagnosticImage Generation	—Unverified	0
Phased Consistency Models	May 28, 2024	Image GenerationVideo Generation	CodeCode Available	4
Multi-modal Generation via Cross-Modal In-Context Learning	May 28, 2024	Image GenerationIn-Context Learning	CodeCode Available	0
MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding	May 28, 2024	Brain DecodingImage Generation	—Unverified	0

Show:10 25 50

← PrevPage 88 of 268Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified