SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 23012350 of 6689 papers

TitleStatusHype
Training Generative Adversarial Networks via stochastic Nash games0
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models0
Classification under strategic adversary manipulation using pessimistic bilevel optimisation0
DreamArtist++: Controllable One-Shot Text-to-Image Generation via Positive-Negative Adapter0
Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models0
Classification Diffusion Models: Revitalizing Density Ratio Estimation0
A Robust Pose Transformational GAN for Pose Guided Person Image Synthesis0
DRDM: A Disentangled Representations Diffusion Model for Synthesizing Realistic Person Images0
DRC: Enhancing Personalized Image Generation via Disentangled Representation Composition0
Adversarial Pixel-Level Generation of Semantic Images0
DrawingInStyles: Portrait Image Generation and Editing with Spatially Conditioned StyleGAN0
Generative Adversarial Networks and Adversarial Autoencoders: Tutorial and Survey0
Generative Adversarial Networks Conditioned by Brain Signals0
DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving0
Clarifying MCMC-based training of modern EBMs : Contrastive Divergence versus Maximum Likelihood0
Drag-guided diffusion models for vehicle image generation0
Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer0
Circuit Complexity Bounds for Visual Autoregressive Model0
ARMOR v0.1: Empowering Autoregressive Multimodal Understanding Model with Interleaved Multimodal Generation via Asymmetric Synergy0
Dr.3D: Adapting 3D GANs to Artistic Drawings0
CipherDM: Secure Three-Party Inference for Diffusion Model Sampling0
Adversarial nets with perceptual losses for text-to-image synthesis0
CIMGEN: Controlled Image Manipulation by Finetuning Pretrained Generative Models on Limited Data0
DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing0
Chinese Typeface Transformation with Hierarchical Adversarial Network0
DPCL-Diff: The Temporal Knowledge Graph Reasoning Based on Graph Node Diffusion Model with Dual-Domain Periodic Contrastive Learning0
DPAF: Image Synthesis via Differentially Private Aggregation in Forward Phase0
Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models0
DOTE: Dual cOnvolutional filTer lEarning for Super-Resolution and Cross-Modality Synthesis in MRI0
DoodleFormer: Creative Sketch Drawing with Transformers0
Don't Forget your Inverse DDIM for Image Editing0
Do Not Mask What You Do Not Need to Mask: a Parser-Free Virtual Try-On0
Chop & Learn: Recognizing and Generating Object-State Compositions0
Generative Adversarial Data Programming0
Generative Adversarial Image Synthesis with Decision Tree Latent Controller0
Generation and Simulation of Yeast Microscopy Imagery with Deep Learning0
Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis0
Chili Pepper Disease Diagnosis via Image Reconstruction Using GrabCut and Generative Adversarial Serial Autoencoder0
Generation of Anonymous Chest Radiographs Using Latent Diffusion Models for Training Thoracic Abnormality Classification Systems0
Chest-Diffusion: A Light-Weight Text-to-Image Model for Report-to-CXR Generation0
Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models0
Data synthesis and adversarial networks: A review and meta-analysis in cancer imaging0
Domain Adaptation Using Adversarial Learning for Autonomous Navigation0
Accelerating Diffusion Models with One-to-Many Knowledge Distillation0
Generation of Synthetic Images for Pedestrian Detection Using a Sequence of GANs0
Do I look like a `cat.n.01` to you? A Taxonomy Image Generation Benchmark0
Does CLIP perceive art the same way we do?0
Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation0
Do Distributed Semantic Models Dream of Electric Sheep? Visualizing Word Representations through Image Synthesis0
On Error Propagation of Diffusion Models0
Show:102550
← PrevPage 47 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified