Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1451–1500 of 6689 papers

Title	Date	Tasks	Status	Hype	Score
Deep Image Spatial Transformation for Person Image Generation	Mar 2, 2020	Image Generation	CodeCode Available	1	5
Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic Image Synthesis	Jul 22, 2023	Contrastive LearningImage Generation	CodeCode Available	1	5
Freestyle Layout-to-Image Synthesis	Mar 25, 2023	image-classificationImage Classification	CodeCode Available	1	5
Freeze the Discriminator: a Simple Baseline for Fine-Tuning GANs	Feb 25, 2020	10-shot image generationImage Generation	CodeCode Available	1	5
LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation	Aug 9, 2023	Image GenerationIn-Context Learning	CodeCode Available	1	5
Learning Flow-based Feature Warping for Face Frontalization with Illumination Inconsistent Supervision	Aug 16, 2020	Face GenerationImage Generation	CodeCode Available	1	5
Learning Semantic-aware Normalization for Generative Adversarial Networks	Dec 1, 2020	Image GenerationImage Inpainting	CodeCode Available	1	5
Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration Network	Jan 16, 2024	Image GenerationImage Restoration	CodeCode Available	1	5
LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation	Jun 30, 2024	AttributeImage Generation	CodeCode Available	1	5
DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation	Apr 9, 2025	Image GenerationText to Image Generation	CodeCode Available	1	5
Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection	Mar 18, 2024	Anomaly DetectionDenoising	CodeCode Available	1	5
From Face to Natural Image: Learning Real Degradation for Blind Image Super-Resolution	Oct 3, 2022	Image GenerationImage Super-Resolution	CodeCode Available	1	5
Dual Pyramid Generative Adversarial Networks for Semantic Image Synthesis	Oct 8, 2022	Generative Adversarial NetworkImage Generation	CodeCode Available	1	5
Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantification	Apr 25, 2023	Image GenerationUncertainty Quantification	CodeCode Available	1	5
Deep Novel View Synthesis from Colored 3D Point Clouds	Aug 1, 2020	Image GenerationNovel View Synthesis	CodeCode Available	1	5
Latent Normalizing Flows for Many-to-Many Cross-Domain Mappings	Feb 16, 2020	Image CaptioningImage Generation	CodeCode Available	1	5
Dual-Domain Image Synthesis using Segmentation-Guided GAN	Apr 19, 2022	CaricatureImage Generation	CodeCode Available	1	5
Latent Denoising Diffusion GAN: Faster sampling, Higher image quality	Jun 17, 2024	DenoisingDiversity	CodeCode Available	1	5
Aligning Text to Image in Diffusion Models is Easier Than You Think	Mar 11, 2025	Contrastive LearningImage Generation	CodeCode Available	1	5
CookGAN: Meal Image Synthesis from Ingredients	Feb 25, 2020	Image Generation	CodeCode Available	1	5
Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging	Apr 11, 2025	AttributeComputational Efficiency	CodeCode Available	1	5
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models	Mar 24, 2025	Image GenerationSuper-Resolution	CodeCode Available	1	5
Image Shape Manipulation from a Single Augmented Training Sample	Jul 2, 2020	Image GenerationImage Manipulation	CodeCode Available	1	5
Bipartite Graph Reasoning GANs for Person Image Generation	Aug 10, 2020	Image GenerationPose Transfer	CodeCode Available	1	5
Deep Sketch-guided Cartoon Video Inbetweening	Aug 10, 2020	Image GenerationOcclusion Estimation	CodeCode Available	1	5
Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation	Aug 27, 2020	Image AnimationImage Generation	CodeCode Available	1	5
Fuzz-Testing Meets LLM-Based Agents: An Automated and Efficient Framework for Jailbreaking Text-To-Image Generation Models	Aug 1, 2024	Image GenerationIn-Context Learning	CodeCode Available	1	5
Deep Unrolled Low-Rank Tensor Completion for High Dynamic Range Imaging	Sep 1, 2022	HDR ReconstructionImage Generation	CodeCode Available	1	5
DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning	Oct 13, 2021	Font GenerationImage Generation	CodeCode Available	1	5
AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars	Oct 12, 2022	DisentanglementFace Model	CodeCode Available	1	5
BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling	Feb 6, 2019	Anomaly DetectionAttribute	CodeCode Available	1	5
AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model Acceleration	Sep 19, 2023	Image Generationsingle-image-generation	CodeCode Available	1	5
CooGAN: A Memory-Efficient Framework for High-Resolution Facial Attribute Editing	Nov 3, 2020	AttributeImage Generation	CodeCode Available	1	5
Large Scale GAN Training for High Fidelity Natural Image Synthesis	Sep 28, 2018	Conditional Image GenerationImage Generation	CodeCode Available	1	5
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models	Dec 5, 2023	Image GenerationModel Selection	CodeCode Available	1	5
Dual Contrastive Loss and Attention for GANs	Mar 31, 2021	Image GenerationUnconditional Image Generation	CodeCode Available	1	5
Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar Reconstruction	Dec 5, 2020	Image Generation	CodeCode Available	1	5
LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis	May 19, 2023	Conditional Image GenerationConditional Text-to-Image Synthesis	CodeCode Available	1	5
Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training Data	Jul 16, 2020	DeepFake DetectionFace Swapping	CodeCode Available	1	5
GAN Path Finder: Preliminary results	Aug 5, 2019	Heuristic SearchImage Generation	CodeCode Available	1	5
An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt Learning	Oct 18, 2023	Image GenerationPrompt Learning	CodeCode Available	1	5
GANs Can Play Lottery Tickets Too	May 31, 2021	Image GenerationImage-to-Image Translation	CodeCode Available	1	5
Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and Synthesis	Jul 10, 2024	Data AugmentationFew-Shot Learning	CodeCode Available	1	5
Blackout Diffusion: Generative Diffusion Models in Discrete-State Spaces	May 18, 2023	Image Generation	CodeCode Available	1	5
GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium	Jun 26, 2017	Emotion Recognition in ConversationImage Generation	CodeCode Available	1	5
DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design	Jul 22, 2024	Image GenerationLanguage Modelling	CodeCode Available	1	5
Drop the GAN: In Defense of Patches Nearest Neighbors as Single Image Generative Models	Mar 29, 2021	Image GenerationImage Manipulation	CodeCode Available	1	5
Dual Attention GANs for Semantic Image Synthesis	Aug 29, 2020	Image GenerationPosition	CodeCode Available	1	5
Dual-Diffusion: Dual Conditional Denoising Diffusion Probabilistic Models for Blind Super-Resolution Reconstruction in RSIs	May 20, 2023	Blind Super-ResolutionDenoising	CodeCode Available	1	5
LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions	Apr 2, 2021	Contrastive LearningImage Generation	CodeCode Available	1	5

Show:10 25 50

← PrevPage 30 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified