Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4051–4100 of 6689 papers

Title	Date	Tasks	Status
Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks	Mar 1, 2024	Image Generation	—Unverified
Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived Dataset	Mar 1, 2024	Image CaptioningImage Generation	CodeCode Available
Rethinking cluster-conditioned diffusion models for label-free image synthesis	Mar 1, 2024	ClusteringConditional Image Generation	CodeCode Available
Disentangling representations of retinal images with generative models	Feb 29, 2024	DisentanglementImage Generation	CodeCode Available
Learning to Find Missing Video Frames with Synthetic Data Augmentation: A General Framework and Application in Generating Thermal Images Using RGB Cameras	Feb 29, 2024	Data AugmentationImage Generation	—Unverified
A Quantitative Evaluation of Score Distillation Sampling Based Text-to-3D	Feb 29, 2024	Image GenerationText to 3D	—Unverified
Balancing Act: Distribution-Guided Debiasing in Diffusion Models	Feb 28, 2024	AttributeData Augmentation	—Unverified
Block and Detail: Scaffolding Sketch-to-Image Generation	Feb 28, 2024	BlockingDataset Generation	—Unverified
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models	Feb 27, 2024	Image Generationparameter-efficient fine-tuning	—Unverified
Advancing Generative Model Evaluation: A Novel Algorithm for Realistic Image Synthesis and Comparison in OCR System	Feb 27, 2024	Image GenerationOptical Character Recognition (OCR)	—Unverified
Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation	Feb 27, 2024	Image Generation	—Unverified
Structure-Guided Adversarial Training of Diffusion Models	Feb 27, 2024	Conditional Image GenerationDenoising	—Unverified
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing	Feb 27, 2024	Image Generation	—Unverified
T-HITL Effectively Addresses Problematic Associations in Image Generation and Maintains Overall Visual Quality	Feb 27, 2024	Image Generation	—Unverified
Understanding Subjectivity through the Lens of Motivational Context in Model-Generated Image Satisfaction	Feb 27, 2024	Image Generation	—Unverified
Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion	Feb 26, 2024	Image GenerationText to Image Generation	—Unverified
Multi-LoRA Composition for Image Generation	Feb 26, 2024	DenoisingImage Generation	—Unverified
The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling	Feb 23, 2024	DecoderImage Generation	—Unverified
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis	Feb 22, 2024	Image GenerationText-to-Video Generation	—Unverified
Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening	Feb 22, 2024	Cell DetectionData Augmentation	—Unverified
Semantic Image Synthesis with Unconditional Generator	Feb 22, 2024	Image GenerationSemantic Segmentation	—Unverified
Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models	Feb 21, 2024	DisentanglementImage Generation	—Unverified
SDXL-Lightning: Progressive Adversarial Diffusion Distillation	Feb 21, 2024	Image GenerationText to Image Generation	—Unverified
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image Synthesis	Feb 20, 2024	Image GenerationPrompt Engineering	CodeCode Available
Layout-to-Image Generation with Localized Descriptions using ControlNet with Cross-Attention Control	Feb 20, 2024	Image GenerationLayout-to-Image Generation	—Unverified
Revisiting registration-based synthesis: A focus on unsupervised MR image synthesis	Feb 19, 2024	AnatomyImage Generation	—Unverified
On the Semantic Latent Space of Diffusion-Based Text-to-Speech Models	Feb 19, 2024	DenoisingImage Generation	—Unverified
Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability	Feb 19, 2024	3D Generation3D Shape Generation	—Unverified
Statistical Test on Diffusion Model-based Anomaly Detection by Selective Inference	Feb 19, 2024	Anomaly DetectionDecision Making	—Unverified
Visual Concept-driven Image Generation with Text-to-Image Diffusion Model	Feb 18, 2024	Image Generation	—Unverified
SDiT: Spiking Diffusion Model with Transformer	Feb 18, 2024	Image Generationmodel	—Unverified
Universal Prompt Optimizer for Safe Text-to-Image Generation	Feb 16, 2024	BlockingImage Generation	CodeCode Available
Training Class-Imbalanced Diffusion Model Via Overlap Optimization	Feb 16, 2024	Contrastive LearningImage Generation	CodeCode Available
UMAIR-FPS: User-aware Multi-modal Animation Illustration Recommendation Fusion with Painting Style	Feb 16, 2024	Image GenerationMulti-modal Recommendation	CodeCode Available
Exploring Precision and Recall to assess the quality and diversity of LLMs	Feb 16, 2024	DiversityImage Generation	CodeCode Available
The Male CEO and the Female Assistant: Evaluation and Mitigation of Gender Biases in Text-To-Image Generation of Dual Subjects	Feb 16, 2024	FairnessImage Generation	—Unverified
Generative AI and Process Systems Engineering: The Next Frontier	Feb 15, 2024	Decision MakingImage Generation	—Unverified
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation	Feb 15, 2024	Image GenerationReinforcement Learning (RL)	—Unverified
Classification Diffusion Models: Revitalizing Density Ratio Estimation	Feb 15, 2024	Audio GenerationClassification	—Unverified
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects	Feb 14, 2024	Image GenerationText to 3D	—Unverified
Towards the Detection of AI-Synthesized Human Face Images	Feb 13, 2024	Face SwappingImage Generation	—Unverified
Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian probability distributions	Feb 12, 2024	Image Generation	—Unverified
Model Collapse Demystified: The Case of Regression	Feb 12, 2024	Image Generationmodel	—Unverified
Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback	Feb 12, 2024	Image GenerationImage Super-Resolution	—Unverified
Disentangled Latent Energy-Based Style Translation: An Image-Level Structural MRI Harmonization Framework	Feb 10, 2024	Image GenerationTranslation	—Unverified
Cardiac ultrasound simulation for autonomous ultrasound navigation	Feb 9, 2024	DiagnosticGPU	—Unverified
Collaborative Control for Geometry-Conditioned PBR Image Generation	Feb 8, 2024	Image Generation	—Unverified
CTGAN: Semantic-guided Conditional Texture Generator for 3D Shapes	Feb 8, 2024	Image GenerationTexture Synthesis	—Unverified
Minecraft-ify: Minecraft Style Image Generation with Text-guided Image Editing for In-Game Application	Feb 8, 2024	Image GenerationMinecraft	—Unverified
ChatScratch: An AI-Augmented System Toward Autonomous Visual Programming Learning for Children Aged 6-12	Feb 7, 2024	Image Generation	—Unverified

Show:10 25 50

← PrevPage 82 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified