Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 6689 papers

Title	Date	Tasks	Status	Hype
Video Perception Models for 3D Scene Synthesis	Jun 25, 2025	3D ReconstructionImage Generation	—Unverified	0
EAR: Erasing Concepts from Unified Autoregressive Models	Jun 25, 2025	Image Generation	CodeCode Available	0
Angio-Diff: Learning a Self-Supervised Adversarial Diffusion Model for Angiographic Geometry Generation	Jun 24, 2025	Image Generation	CodeCode Available	0
SoK: Can Synthetic Images Replace Real Data? A Survey of Utility and Privacy of Synthetic Image Generation	Jun 24, 2025	Image GenerationPrivacy Preserving	—Unverified	0
OmniGen2: Exploration to Advanced Multimodal Generation	Jun 23, 2025	Image Generationmultimodal generation	CodeCode Available	7
Morse: Dual-Sampling for Lossless Acceleration of Diffusion Models	Jun 23, 2025	Image Generation	CodeCode Available	1
ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation	Jun 22, 2025	GPUImage Generation	CodeCode Available	3
Programmable-Room: Interactive Textured 3D Room Meshes Generation Empowered by Large Language Models	Jun 21, 2025	AttributeImage Generation	—Unverified	0
AI's Blind Spots: Geographic Knowledge and Diversity Deficit in Generated Urban Scenario	Jun 20, 2025	DiversityEntity Disambiguation	—Unverified	0
DreamCube: 3D Panorama Generation via Multi-plane Synchronization	Jun 20, 2025	Depth EstimationImage Generation	—Unverified	0
The Hidden Cost of an Image: Quantifying the Energy Consumption of AI Image Generation	Jun 20, 2025	Image GenerationQuantization	—Unverified	0
Beyond Blur: A Fluid Perspective on Generative Diffusion Models	Jun 20, 2025	DiversityGPU	—Unverified	0
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens	Jun 20, 2025	Image GenerationMultimodal Reasoning	CodeCode Available	3
Category-based Galaxy Image Generation via Diffusion Models	Jun 19, 2025	AstronomyImage Generation	—Unverified	0
PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models	Jun 19, 2025	Image GenerationQuantization	—Unverified	0
Watermarking Autoregressive Image Generation	Jun 19, 2025	Image GenerationLanguage Modeling	CodeCode Available	2
Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model	Jun 18, 2025	Image Generation	CodeCode Available	1
DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion	Jun 17, 2025	DenoisingImage Generation	—Unverified	0
Risk Estimation of Knee Osteoarthritis Progression via Predictive Multi-task Modelling from Efficient Diffusion Model using X-ray Images	Jun 17, 2025	Image GenerationInterpretable Machine Learning	—Unverified	0
Cost-Aware Routing for Efficient Text-To-Image Generation	Jun 17, 2025	DenoisingImage Generation	—Unverified	0
Align Your Flow: Scaling Continuous-Time Flow Map Distillation	Jun 17, 2025	Image Generation	—Unverified	0
Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models	Jun 17, 2025	Attributecounterfactual	—Unverified	0
VideoMAR: Autoregressive Video Generatio with Continuous Tokens	Jun 17, 2025	GPUImage Generation	—Unverified	0
Fair Generation without Unfair Distortions: Debiasing Text-to-Image Generation with Entanglement-Free Attention	Jun 16, 2025	AttributeImage Generation	—Unverified	0
Quantitative Comparison of Fine-Tuning Techniques for Pretrained Latent Diffusion Models in the Generation of Unseen SAR Image Concepts	Jun 16, 2025	Image Generation	—Unverified	0
Deep Diffusion Models and Unsupervised Hyperspectral Unmixing for Realistic Abundance Map Synthesis	Jun 16, 2025	BenchmarkingData Augmentation	—Unverified	0
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation	Jun 13, 2025	Image GenerationNovel View Synthesis	—Unverified	0
Exploring the Effectiveness of Deep Features from Domain-Specific Foundation Models in Retinal Image Synthesis	Jun 13, 2025	Edge DetectionFairness	—Unverified	0
A Watermark for Auto-Regressive Image Generation Models	Jun 13, 2025	Face SwappingImage Generation	—Unverified	0
Edit360: 2D Image Edits to 3D Assets from Any Angle	Jun 12, 2025	Image Generation	—Unverified	0
Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation	Jun 12, 2025	Image Generationmultimodal generation	—Unverified	0
Anatomy-Grounded Weakly Supervised Prompt Tuning for Chest X-ray Latent Diffusion Models	Jun 12, 2025	AnatomyImage Generation	—Unverified	0
High-resolution efficient image generation from WiFi CSI using a pretrained latent diffusion model	Jun 12, 2025	Computational EfficiencyDenoising	—Unverified	0
Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models	Jun 12, 2025	Image GenerationSegmentation	—Unverified	0
The Role of Generative AI in Facilitating Social Interactions: A Scoping Review	Jun 12, 2025	Image Generation	—Unverified	0
Text to Image for Multi-Label Image Recognition with Joint Prompt-Adapter Learning	Jun 12, 2025	Contrastive LearningImage Generation	—Unverified	0
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning	Jun 12, 2025	Image GenerationMultimodal Reasoning	—Unverified	0
Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression	Jun 11, 2025	Image Generation	CodeCode Available	2
SAGE: Exploring the Boundaries of Unsafe Concept Domain with Semantic-Augment Erasing	Jun 11, 2025	Image GenerationText to Image Generation	CodeCode Available	0
SPARKE: Scalable Prompt-Aware Diversity Guidance in Diffusion Models via RKE Score	Jun 11, 2025	DiversityImage Generation	—Unverified	0
Prompt-Guided Latent Diffusion with Predictive Class Conditioning for 3D Prostate MRI Generation	Jun 11, 2025	Image GenerationLanguage Modeling	—Unverified	0
Only-Style: Stylistic Consistency in Image Generation without Content Leakage	Jun 11, 2025	Image Generation	—Unverified	0
HadaNorm: Diffusion Transformer Quantization through Mean-Centered Transformations	Jun 11, 2025	Image GenerationQuantization	—Unverified	0
Ming-Omni: A Unified Multimodal Model for Perception and Generation	Jun 11, 2025	Image Generationtext-to-speech	CodeCode Available	4
ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models	Jun 11, 2025	Image GenerationImage Segmentation	—Unverified	0
Consistent Story Generation with Asymmetry Zigzag Sampling	Jun 11, 2025	Image GenerationStory Generation	CodeCode Available	0
Geometric Regularity in Deterministic Sampling of Diffusion-based Generative Models	Jun 11, 2025	Image Generation	—Unverified	0
Noise Conditional Variational Score Distillation	Jun 11, 2025	Conditional Image GenerationDenoising	CodeCode Available	1
Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better	Jun 10, 2025	Image Generation	CodeCode Available	2
Diffuse and Disperse: Image Generation with Representation Regularization	Jun 10, 2025	Image Generationregression	—Unverified	0

Show:10 25 50

← PrevPage 2 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified