Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3601–3650 of 6689 papers

Title	Date	Tasks	Status	Hype
Diffusion idea exploration for art generation	Jul 11, 2023	Generative Adversarial NetworkImage Generation	—Unverified	0
Automatic Generation of Semantic Parts for Face Image Synthesis	Jul 11, 2023	DecoderImage Generation	CodeCode Available	0
Cross-Modality Fourier Feature for Medical Image Synthesis	Jul 10, 2023	Image Generation	CodeCode Available	0
Exact Diffusion Inversion via Bi-directional Integration Approximation	Jul 10, 2023	Image GenerationImage Reconstruction	CodeCode Available	1
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback	Jul 10, 2023	Image GenerationVisual Question Answering (VQA)	—Unverified	0
K-Space-Aware Cross-Modality Score for Synthesized Neuroimage Quality Assessment	Jul 10, 2023	Image GenerationSSIM	—Unverified	0
DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer	Jul 9, 2023	Image GenerationStyle Transfer	—Unverified	0
Score-based Conditional Generation with Fewer Labeled Data by Self-calibrating Classifier Guidance	Jul 9, 2023	Image Generation	—Unverified	0
Measuring the Success of Diffusion Models at Imitating Human Artists	Jul 8, 2023	image-classificationImage Classification	—Unverified	0
Censored Sampling of Diffusion Models Using 3 Minutes of Human Feedback	Jul 6, 2023	Image Generation	CodeCode Available	1
On the Cultural Gap in Text-to-Image Generation	Jul 6, 2023	Image GenerationText to Image Generation	CodeCode Available	0
SoK: Privacy-Preserving Data Synthesis	Jul 5, 2023	Image GenerationPrivacy Preserving	—Unverified	0
On the Adversarial Robustness of Generative Autoencoders in the Latent Space	Jul 5, 2023	Adversarial Robustnesscompressed sensing	—Unverified	0
Prompting Diffusion Representations for Cross-Domain Semantic Segmentation	Jul 5, 2023	Domain AdaptationDomain Generalization	CodeCode Available	0
Interpretable Computer Vision Models through Adversarial Training: Unveiling the Robustness-Interpretability Connection	Jul 4, 2023	Feature ImportanceImage Generation	CodeCode Available	0
AdAM: Few-Shot Image Generation via Adaptation-Aware Kernel Modulation	Jul 4, 2023	Domain AdaptationImage Generation	—Unverified	0
Disentanglement in a GAN for Unconditional Speech Synthesis	Jul 4, 2023	DisentanglementGenerative Adversarial Network	CodeCode Available	1
Training Energy-Based Models with Diffusion Contrastive Divergences	Jul 4, 2023	DenoisingImage Denoising	—Unverified	0
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep Learning-Based Electrocardiogram Digitization	Jul 4, 2023	Data AugmentationDecision Making	CodeCode Available	1
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis	Jul 4, 2023	Image Generation	CodeCode Available	2
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion	Jul 3, 2023	Image Generation	CodeCode Available	2
DifFSS: Diffusion Model for Few-Shot Semantic Segmentation	Jul 3, 2023	Few-Shot Semantic SegmentationImage Generation	CodeCode Available	1
Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis	Jul 3, 2023	Contrastive LearningImage Generation	—Unverified	0
Squeezing Large-Scale Diffusion Models for Mobile	Jul 3, 2023	Image Generation	—Unverified	0
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance	Jul 2, 2023	Image Generation	—Unverified	0
Facial Image Generation from Bangla Textual Description using DCGAN and Bangla FastText	Jul 1, 2023	Generative Adversarial NetworkImage Generation	CodeCode Available	0
DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation	Jul 1, 2023	Image Generation	—Unverified	0
AIGCIQA2023: A Large-scale Image Quality Assessment Database for AI Generated Images: from the Perspectives of Quality, Authenticity and Correspondence	Jul 1, 2023	Image GenerationImage Quality Assessment	CodeCode Available	1
Practical and Asymptotically Exact Conditional Sampling in Diffusion Models	Jun 30, 2023	Computational EfficiencyConditional Image Generation	CodeCode Available	1
Counting Guidance for High Fidelity Text-to-Image Synthesis	Jun 30, 2023	DenoisingImage Generation	—Unverified	0
Stay on topic with Classifier-Free Guidance	Jun 30, 2023	Code GenerationCommon Sense Reasoning	—Unverified	0
DreamDiffusion: Generating High-Quality Images from Brain EEG Signals	Jun 29, 2023	EEGElectroencephalogram (EEG)	CodeCode Available	2
CLIPAG: Towards Generator-Free Text-to-Image Generation	Jun 29, 2023	image-classificationImage Classification	—Unverified	0
NaturalInversion: Data-Free Image Synthesis Improving Real-World Consistency	Jun 29, 2023	Image GenerationKnowledge Distillation	CodeCode Available	1
Semi-supervised Multimodal Representation Learning through a Global Workspace	Jun 27, 2023	Image CaptioningImage Generation	CodeCode Available	0
Approximated Prompt Tuning for Vision-Language Pre-trained Models	Jun 27, 2023	image-classificationImage Classification	—Unverified	0
Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models	Jun 26, 2023	Image Generation	CodeCode Available	1
A-STAR: Test-time Attention Segregation and Retention for Text-to-image Synthesis	Jun 26, 2023	AllDenoising	—Unverified	0
Localized Text-to-Image Generation for Free via Cross Attention Control	Jun 26, 2023	Image GenerationSemantic Segmentation	—Unverified	0
Fuzzy-Conditioned Diffusion and Diffusion Projection Attention Applied to Facial Image Correction	Jun 26, 2023	Image Generation	CodeCode Available	1
A Simple and Effective Baseline for Attentional Generative Adversarial Networks	Jun 26, 2023	Image Generation	CodeCode Available	1
DomainStudio: Fine-Tuning Diffusion Models for Domain-Driven Image Generation using Limited Data	Jun 25, 2023	DenoisingDiversity	CodeCode Available	0
UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation	Jun 24, 2023	Image GenerationImage Segmentation	CodeCode Available	0
Zero-shot spatial layout conditioning for text-to-image diffusion models	Jun 23, 2023	Image GenerationSegmentation	—Unverified	0
Simultaneous Image-to-Zero and Zero-to-Noise: Diffusion Models with Analytical Image Attenuation	Jun 23, 2023	DenoisingEdge Detection	CodeCode Available	1
A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules	Jun 23, 2023	ClusteringGenerative Adversarial Network	CodeCode Available	0
DiffInfinite: Large Mask-Image Synthesis via Parallel Random Patch Diffusion in Histopathology	Jun 23, 2023	Image GenerationSegmentation	CodeCode Available	1
Directional diffusion models for graph representation learning	Jun 22, 2023	3D Molecule GenerationDenoising	—Unverified	0
DreamEdit: Subject-driven Image Editing	Jun 22, 2023	Image GenerationPosition	—Unverified	0
Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase	Jun 21, 2023	3D-Aware Image SynthesisBenchmarking	CodeCode Available	1

Show:10 25 50

← PrevPage 73 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified