Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4151–4200 of 6689 papers

Title	Date	Tasks	Status
Score Distillation Sampling with Learned Manifold Corrective	Jan 10, 2024	DenoisingImage Generation	—Unverified
AI Art is Theft: Labour, Extraction, and Exploitation, Or, On the Dangers of Stochastic Pollocks	Jan 10, 2024	Image Generation	—Unverified
Content-Conditioned Generation of Stylized Free hand Sketches	Jan 9, 2024	Data AugmentationImage Generation	—Unverified
EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models	Jan 9, 2024	AttributeDiversity	—Unverified
Vision Reimagined: AI-Powered Breakthroughs in WiFi Indoor Imaging	Jan 9, 2024	Image Generation	—Unverified
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding	Jan 9, 2024	Image Captioningimage-classification	—Unverified
3D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis	Jan 8, 2024	DisentanglementImage Generation	—Unverified
Detecting Face Synthesis Using a Concealed Fusion Model	Jan 8, 2024	Computer SecurityFace Generation	—Unverified
Data-Agnostic Face Image Synthesis Detection Using Bayesian CNNs	Jan 8, 2024	Anomaly DetectionImage Generation	—Unverified
Controllable Image Synthesis of Industrial Data Using Stable Diffusion	Jan 6, 2024	Defect DetectionImage Generation	—Unverified
A Dataset and Benchmark for Copyright Infringement Unlearning from Text-to-Image Diffusion Models	Jan 4, 2024	Image GenerationText to Image Generation	CodeCode Available
Improving Diffusion-Based Image Synthesis with Context Prediction	Jan 4, 2024	DecoderDenoising	—Unverified
A Vision Check-up for Language Models	Jan 3, 2024	Image GenerationRepresentation Learning	—Unverified
DDPM based X-ray Image Synthesizer	Jan 3, 2024	DenoisingImage Generation	—Unverified
Instruct-Imagen: Image Generation with Multi-modal Instruction	Jan 3, 2024	Image GenerationRetrieval	—Unverified
Few-shot Image Generation via Information Transfer from the Built Geodesic Surface	Jan 3, 2024	DiversityImage Generation	—Unverified
GeoPos: A Minimal Positional Encoding for Enhanced Fine-Grained Details in Image Synthesis Using Convolutional Neural Networks	Jan 3, 2024	DenoisingImage Generation	—Unverified
SSP: A Simple and Safe automatic Prompt engineering method towards realistic image synthesis on LVM	Jan 2, 2024	Image GenerationInstruction Following	—Unverified
Joint Generative Modeling of Scene Graphs and Images via Diffusion Models	Jan 2, 2024	Graph GenerationImage Generation	—Unverified
Adversarial Text to Continuous Image Generation	Jan 1, 2024	Adversarial TextImage Generation	—Unverified
Unmixing Before Fusion: A Generalized Paradigm for Multi-Source-based Hyperspectral Image Synthesis	Jan 1, 2024	Image Generation	—Unverified
PaReNeRF: Toward Fast Large-scale Dynamic NeRF with Patch-based Reference	Jan 1, 2024	Autonomous DrivingDecoder	—Unverified
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action	Jan 1, 2024	Image GenerationInstruction Following	—Unverified
Learning Coupled Dictionaries from Unpaired Data for Image Super-Resolution	Jan 1, 2024	DiversityImage Generation	—Unverified
Video Prediction by Modeling Videos as Continuous Multi-Dimensional Processes	Jan 1, 2024	Image GenerationPrediction	—Unverified
Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning	Jan 1, 2024	Decision MakingDiversity	—Unverified
Learning Diffusion Texture Priors for Image Restoration	Jan 1, 2024	Image GenerationImage Restoration	—Unverified
Generate Subgoal Images before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts	Jan 1, 2024	Image GenerationInstruction Following	—Unverified
Vector Graphics Generation via Mutually Impulsed Dual-domain Diffusion	Jan 1, 2024	Image GenerationVector Graphics	—Unverified
AnyScene: Customized Image Synthesis with Composited Foreground	Jan 1, 2024	Image Generation	—Unverified
MAGICK: A Large-scale Captioned Dataset from Matting Generated Images using Chroma Keying	Jan 1, 2024	Image GenerationImage Matting	—Unverified
Exact Fusion via Feature Distribution Matching for Few-shot Image Generation	Jan 1, 2024	Data AugmentationDiversity	CodeCode Available
Countering Personalized Text-to-Image Generation with Influence Watermarks	Jan 1, 2024	Image CompressionImage Generation	—Unverified
One-dimensional Adapter to Rule Them All: Concepts Diffusion Models and Erasing Applications	Jan 1, 2024	AllImage Generation	—Unverified
ZeroRF: Fast Sparse View 360deg Reconstruction with Zero Pretraining	Jan 1, 2024	Image GenerationNeRF	—Unverified
Taming Stable Diffusion for Text to 360 Panorama Image Generation	Jan 1, 2024	DenoisingImage Generation	—Unverified
TextNeRF: A Novel Scene-Text Image Synthesis Method based on Neural Radiance Fields	Jan 1, 2024	Image GenerationNeRF	CodeCode Available
Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation	Jan 1, 2024	Image GenerationText to Image Generation	—Unverified
Text-conditional Attribute Alignment across Latent Spaces for 3D Controllable Face Image Synthesis	Jan 1, 2024	AttributeImage Generation	—Unverified
DiffMorph: Text-less Image Morphing with Diffusion Models	Jan 1, 2024	Image GenerationImage Morphing	—Unverified
CoDi-2: In-Context Interleaved and Interactive Any-to-Any Generation	Jan 1, 2024	Image GenerationLanguage Modeling	—Unverified
RainSD: Rain Style Diversification Module for Image Synthesis Enhancement using Feature-Level Style Distribution	Dec 31, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified
Generative Model-Driven Synthetic Training Image Generation: An Approach to Cognition in Rail Defect Detection	Dec 31, 2023	Data AugmentationDefect Detection	CodeCode Available
GAN-GA: A Generative Model based on Genetic Algorithm for Medical Image Generation	Dec 30, 2023	Data AugmentationDiversity	CodeCode Available
GazeCLIP: Towards Enhancing Gaze Estimation via Text Guidance	Dec 30, 2023	Gaze EstimationImage Generation	—Unverified
CycleGAN Models for MRI Image Translation	Dec 28, 2023	Image GenerationImage-to-Image Translation	—Unverified
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion	Dec 27, 2023	Computational EfficiencyDenoising	—Unverified
RefineNet: Enhancing Text-to-Image Conversion with High-Resolution and Detail Accuracy through Hierarchical Transformers and Progressive Refinement	Dec 27, 2023	Computational EfficiencyImage Generation	—Unverified
Prompt Expansion for Adaptive Text-to-Image Generation	Dec 27, 2023	Image GenerationText to Image Generation	—Unverified
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos	Dec 25, 2023	Image GenerationText to Image Generation	—Unverified

Show:10 25 50

← PrevPage 84 of 134Next →

All datasets ImageNet 256x256 CIFAR-10 ImageNet 64x64 ImageNet 512x512 FFHQ 256 x 256 CelebA 64x64 ImageNet 32x32 LSUN Bedroom 256 x 256 STL-10 LSUN Churches 256 x 256 ImageNet 128x128 FFHQ 1024 x 1024

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Improved DDPM	FID	12.3	—	Unverified
2	ADM	FID	11.84	—	Unverified
3	BigGAN-deep	FID	8.1	—	Unverified
4	Polarity-BigGAN	FID	6.82	—	Unverified
5	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	FID	6.59	—	Unverified
6	MaskGIT	FID	6.18	—	Unverified
7	VQGAN+Transformer (k=600, p=1.0, a=0.05)	FID	5.2	—	Unverified
8	CDM	FID	4.88	—	Unverified
9	ADM-G	FID	4.59	—	Unverified
10	RIN	FID	4.51	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PresGAN	FID	52.2	—	Unverified
2	RESFLOW	FID	48.29	—	Unverified
3	Residual Flow	FID	46.37	—	Unverified
4	GLF+perceptual loss (ours)	FID	44.6	—	Unverified
5	ProdPoly no activation functions	FID	40.45	—	Unverified
6	ProdPoly no activation functions	FID	36.77	—	Unverified
7	ACGAN	FID	35.47	—	Unverified
8	DenseFlow-74-10	FID	34.9	—	Unverified
9	NVAE w/ flow	FID	32.53	—	Unverified
10	QSNGAN	FID	31.97	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GLIDE + CLS	FID	30.87	—	Unverified
2	GLIDE + CLIP	FID	30.46	—	Unverified
3	GLIDE + CLS-FREE	FID	29.22	—	Unverified
4	GLIDE + CLIP + CLS + CLS-FREE	FID	29.18	—	Unverified
5	PGMGAN	FID	21.73	—	Unverified
6	CLR-GAN	FID	20.27	—	Unverified
7	FM	FID	14.45	—	Unverified
8	CT (Direct Generation, NFE=1)	FID	13	—	Unverified
9	CT (Direct Generation, NFE=2)	FID	11.1	—	Unverified
10	GLIDE +CLS	KID	7.95	—	Unverified