Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4151–4175 of 6689 papers

Title	Date	Tasks	Status
Score Distillation Sampling with Learned Manifold Corrective	Jan 10, 2024	DenoisingImage Generation	—Unverified
AI Art is Theft: Labour, Extraction, and Exploitation, Or, On the Dangers of Stochastic Pollocks	Jan 10, 2024	Image Generation	—Unverified
Content-Conditioned Generation of Stylized Free hand Sketches	Jan 9, 2024	Data AugmentationImage Generation	—Unverified
EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models	Jan 9, 2024	AttributeDiversity	—Unverified
Vision Reimagined: AI-Powered Breakthroughs in WiFi Indoor Imaging	Jan 9, 2024	Image Generation	—Unverified
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding	Jan 9, 2024	Image Captioningimage-classification	—Unverified
3D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis	Jan 8, 2024	DisentanglementImage Generation	—Unverified
Detecting Face Synthesis Using a Concealed Fusion Model	Jan 8, 2024	Computer SecurityFace Generation	—Unverified
Data-Agnostic Face Image Synthesis Detection Using Bayesian CNNs	Jan 8, 2024	Anomaly DetectionImage Generation	—Unverified
Controllable Image Synthesis of Industrial Data Using Stable Diffusion	Jan 6, 2024	Defect DetectionImage Generation	—Unverified
A Dataset and Benchmark for Copyright Infringement Unlearning from Text-to-Image Diffusion Models	Jan 4, 2024	Image GenerationText to Image Generation	CodeCode Available
Improving Diffusion-Based Image Synthesis with Context Prediction	Jan 4, 2024	DecoderDenoising	—Unverified
A Vision Check-up for Language Models	Jan 3, 2024	Image GenerationRepresentation Learning	—Unverified
DDPM based X-ray Image Synthesizer	Jan 3, 2024	DenoisingImage Generation	—Unverified
Instruct-Imagen: Image Generation with Multi-modal Instruction	Jan 3, 2024	Image GenerationRetrieval	—Unverified
Few-shot Image Generation via Information Transfer from the Built Geodesic Surface	Jan 3, 2024	DiversityImage Generation	—Unverified
GeoPos: A Minimal Positional Encoding for Enhanced Fine-Grained Details in Image Synthesis Using Convolutional Neural Networks	Jan 3, 2024	DenoisingImage Generation	—Unverified
SSP: A Simple and Safe automatic Prompt engineering method towards realistic image synthesis on LVM	Jan 2, 2024	Image GenerationInstruction Following	—Unverified
Joint Generative Modeling of Scene Graphs and Images via Diffusion Models	Jan 2, 2024	Graph GenerationImage Generation	—Unverified
Adversarial Text to Continuous Image Generation	Jan 1, 2024	Adversarial TextImage Generation	—Unverified
Unmixing Before Fusion: A Generalized Paradigm for Multi-Source-based Hyperspectral Image Synthesis	Jan 1, 2024	Image Generation	—Unverified
PaReNeRF: Toward Fast Large-scale Dynamic NeRF with Patch-based Reference	Jan 1, 2024	Autonomous DrivingDecoder	—Unverified
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action	Jan 1, 2024	Image GenerationInstruction Following	—Unverified
Learning Coupled Dictionaries from Unpaired Data for Image Super-Resolution	Jan 1, 2024	DiversityImage Generation	—Unverified
Video Prediction by Modeling Videos as Continuous Multi-Dimensional Processes	Jan 1, 2024	Image GenerationPrediction	—Unverified

Show:10 25 50

← PrevPage 167 of 268Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified