Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 701–750 of 6689 papers

Title	Date	Tasks	Status	Hype	Score
Diffusion Facial Forgery Detection	Jan 29, 2024	Image Generation	CodeCode Available	2	5
Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks	May 5, 2021	image-classificationImage Classification	CodeCode Available	2	5
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion	Jul 20, 2023	Conditional Text-to-Image SynthesisDenoising	CodeCode Available	2	5
A Style-Based Generator Architecture for Generative Adversarial Networks	Dec 12, 2018	DisentanglementImage Generation	CodeCode Available	2	5
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in Videos	Jul 23, 2024	Image GenerationPoint Tracking	CodeCode Available	2	5
Flux Already Knows -- Activating Subject-Driven Image Generation without Training	Apr 12, 2025	Image GenerationVirtual Try-on	CodeCode Available	2	5
Diffusion-GAN: Training GANs with Diffusion	Jun 5, 2022	Image Generation	CodeCode Available	2	5
Diffusion Enhancement for Cloud Removal in Ultra-Resolution Remote Sensing Imagery	Jan 25, 2024	Cloud RemovalImage Generation	CodeCode Available	2	5
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation	Mar 25, 2024	DenoisingImage Generation	CodeCode Available	2	5
Planting a SEED of Vision in Large Language Model	Jul 16, 2023	Image GenerationImage to text	CodeCode Available	2	5
Diffusion Explainer: Visual Explanation for Text-to-image Stable Diffusion	May 4, 2023	Image Generation	CodeCode Available	2	5
Hybrid Fourier Score Distillation for Efficient One Image to 3D Object Generation	May 31, 2024	3D GenerationImage Generation	CodeCode Available	2	5
Cross-view Masked Diffusion Transformers for Person Image Synthesis	Feb 2, 2024	DenoisingImage Generation	CodeCode Available	2	5
Pose-Normalized Image Generation for Person Re-identification	Dec 6, 2017	Generative Adversarial NetworkImage Generation	CodeCode Available	2	5
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition	May 22, 2024	Image Generation	CodeCode Available	2	5
Progressive Distillation for Fast Sampling of Diffusion Models	Feb 1, 2022	Density EstimationImage Generation	CodeCode Available	2	5
Boosting Flow-based Generative Super-Resolution Models via Learned Prior	Mar 16, 2024	Image GenerationImage Super-Resolution	CodeCode Available	2	5
Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching	May 29, 2024	compressed sensingDeblurring	CodeCode Available	2	5
Flow Matching for Medical Image Synthesis: Bridging the Gap Between Speed and Quality	Mar 1, 2025	Image EnhancementImage Generation	CodeCode Available	2	5
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance	Apr 8, 2025	Image Generation	CodeCode Available	2	5
Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers	Jun 25, 2024	Image GenerationModel Compression	CodeCode Available	2	5
Flow Matching in Latent Space	Jul 17, 2023	Computational EfficiencyImage Generation	CodeCode Available	2	5
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner	Sep 26, 2024	Image GenerationText to Image Generation	CodeCode Available	2	5
RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models	Feb 20, 2024	DenoisingImage Generation	CodeCode Available	2	5
CTR-Driven Advertising Image Generation with Multimodal Large Language Models	Feb 5, 2025	Image GenerationReinforcement Learning (RL)	CodeCode Available	2	5
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching	Dec 19, 2024	Image GenerationPrediction	CodeCode Available	2	5
CapHuman: Capture Your Moments in Parallel Universes	Feb 1, 2024	Image Generation	CodeCode Available	2	5
Boosting Latent Diffusion with Flow Matching	Dec 12, 2023	DecoderDiversity	CodeCode Available	2	5
Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation	Dec 12, 2024	Image AugmentationImage Generation	CodeCode Available	2	5
Flow-Guided Diffusion for Video Inpainting	Nov 26, 2023	DenoisingImage Generation	CodeCode Available	2	5
Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens	Oct 17, 2024	Image GenerationText to Image Generation	CodeCode Available	2	5
Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach	May 23, 2023	GPUImage Generation	CodeCode Available	2	5
RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation	Nov 17, 2022	3D Generation3D Reconstruction	CodeCode Available	2	5
From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation	Apr 23, 2024	Image Generation	CodeCode Available	2	5
Geometry-Complete Diffusion for 3D Molecule Generation and Optimization	Feb 8, 2023	3D Molecule GenerationDenoising	CodeCode Available	2	5
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer	Mar 25, 2023	Image Generation	CodeCode Available	2	5
Semantic Image Synthesis via Diffusion Models	Jun 30, 2022	DecoderDenoising	CodeCode Available	2	5
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation	Oct 6, 2021	AttributeImage Generation	CodeCode Available	1	5
Diffusion Cocktail: Mixing Domain-Specific Diffusion Models for Diversified Image Generations	Dec 12, 2023	Image GenerationImage Manipulation	CodeCode Available	1	5
A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models	Aug 12, 2024	Image Generation	CodeCode Available	1	5
FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion	Mar 6, 2024	Fire DetectionImage Generation	CodeCode Available	1	5
Diffusion-based Image Generation for In-distribution Data Augmentation in Surface Defect Detection	Jun 1, 2024	Data AugmentationDefect Detection	CodeCode Available	1	5
Diffusion-based Data Augmentation for Nuclei Image Segmentation	Oct 22, 2023	Data AugmentationImage Generation	CodeCode Available	1	5
Diffusion-Based Scene Graph to Image Generation with Masked Contrastive Pre-Training	Nov 21, 2022	Image Generation	CodeCode Available	1	5
A Simple and Robust Framework for Cross-Modality Medical Image Segmentation applied to Vision Transformers	Oct 9, 2023	Heart SegmentationImage Generation	CodeCode Available	1	5
A Simple and Effective Baseline for Attentional Generative Adversarial Networks	Jun 26, 2023	Image Generation	CodeCode Available	1	5
FlexDiT: Dynamic Token Density Control for Diffusion Transformer	Dec 8, 2024	Computational EfficiencyDenoising	CodeCode Available	1	5
Diffusion Autoencoders: Toward a Meaningful and Decodable Representation	Nov 30, 2021	AttributeDecoder	CodeCode Available	1	5
A Shared Representation for Photorealistic Driving Simulators	Dec 9, 2021	Autonomous VehiclesImage Generation	CodeCode Available	1	5
Diffusion-based Blind Text Image Super-Resolution	Dec 13, 2023	Image GenerationImage Super-Resolution	CodeCode Available	1	5

Show:10 25 50

← PrevPage 15 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified