SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 38263850 of 6689 papers

TitleStatusHype
Holistic Evaluation for Interleaved Text-and-Image Generation0
The State of the Art when using GPUs in Devising Image Generation Methods Using Deep Learning0
The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation0
How Animals Dance (When You're Not Looking)0
How do Minimum-Norm Shallow Denoisers Look in Function Space?0
The Surprising Effectiveness of Skip-Tuning in Diffusion Sampling0
How Generative Adversarial Networks and Their Variants Work: An Overview0
How Image Generation Helps Visible-to-Infrared Person Re-Identification?0
How Real Is Real? A Human Evaluation Framework for Unrestricted Adversarial Examples0
How Stable is Stable Diffusion under Recursive InPainting (RIP)?0
How to build a consistency model: Learning flow maps via self-distillation0
How to Construct Energy for Images? Denoising Autoencoder Can Be Energy Based Model0
How to Fill the Optimum Set? Population Gradient Descent with Harmless Diversity0
The Swiss Army Knife for Image-to-Image Translation: Multi-Task Diffusion Models0
How to make words with vectors: Phrase generation in distributional semantics0
EvolvED: Evolutionary Embeddings to Understand the Generation Process of Diffusion Models0
The Value of AI Guidance in Human Examination of Synthetically-Generated Faces0
Things not Written in Text: Exploring Spatial Commonsense from Visual Signals0
Zero-1-to-G: Taming Pretrained 2D Diffusion Model for Direct 3D Generation0
A Unified Framework for Diffusion Bridge Problems: Flow Matching and Schrödinger Matching into One0
Human Appearance Transfer0
HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation0
Human Evaluation of Text-to-Image Models on a Multi-Task Benchmark0
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning0
HumanGAN: A Generative Model of Humans Images0
Show:102550
← PrevPage 154 of 268Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified