SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 66016650 of 6689 papers

TitleStatusHype
Collaborative Sampling in Generative Adversarial NetworksCode0
What can Discriminator do? Towards Box-free Ownership Verification of Generative Adversarial NetworkCode0
Detail-aware multi-view stereo network for depth estimationCode0
Redistribute Ensemble Training for Mitigating Memorization in Diffusion ModelsCode0
Collaborative Interactive Evolution of Art in the Latent Space of Deep Generative ModelsCode0
Whitening and Coloring batch transform for GANsCode0
A Unified Debiasing Approach for Vision-Language Models across Modalities and TasksCode0
Training Generative Adversarial Networks from Incomplete Observations using Factorised DiscriminatorsCode0
What can Discriminator do? Towards Box-free Ownership Verification of Generative Adversarial NetworksCode0
Semantic Hierarchy Emerges in Deep Generative Representations for Scene SynthesisCode0
Reducing Spatial Fitting Error in Distillation of Denoising Diffusion ModelsCode0
Human-Guided Image Generation for Expanding Small-Scale Training Image DatasetsCode0
Training language GANs from ScratchCode0
DPGEN: Differentially Private Generative Energy-Guided Network for Natural Image SynthesisCode0
Adversarial Out-domain Examples for Generative ModelsCode0
Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR ImagesCode0
A Unified Approach of Multi-scale Deep and Hand-crafted Features for Defocus EstimationCode0
AugStatic - A Light-Weight Image Augmentation LibraryCode0
CollaFuse: Collaborative Diffusion ModelsCode0
ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and GenerationCode0
Decoupled Learning for Conditional Adversarial NetworksCode0
Overcoming the Disentanglement vs Reconstruction Trade-off via Jacobian SupervisionCode0
Mapping Instructions to Actions in 3D Environments with Visual Goal PredictionCode0
DRAN: Detailed Region-Adaptive Normalization for Conditional Image SynthesisCode0
Spatially Controllable Image Synthesis with Internal Representation CollagingCode0
semantic image synthesis of anime characters based on conditional generative adversarial networksCode0
P^2-GAN: Efficient Style Transfer Using Single Style ImageCode0
Referenceless User Controllable Semantic Image SynthesisCode0
Array Camera Image Fusion using Physics-Aware TransformersCode0
AR-RAG: Autoregressive Retrieval Augmentation for Image GenerationCode0
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I ModelsCode0
Mask and Restore: Blind Backdoor Defense at Test Time with Masked AutoencoderCode0
Efficient generative adversarial networks using linear additive-attention TransformersCode0
Progressive Augmentation of GANsCode0
Semantic Image Synthesis via Class-Adaptive Cross-AttentionCode0
PAGER: Progressive Attribute-Guided Extendable Robust Image GenerationCode0
Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet HierarchyCode0
Hyperparameter-Free Medical Image Synthesis for Sharing Data and Improving Site-Specific SegmentationCode0
Visual Object Networks: Image Generation with Disentangled 3D RepresentationsCode0
Hyperspectral Blind Unmixing using a Double Deep Image PriorCode0
Color Agnostic Cross-Spectral Disparity EstimationCode0
An Inversion-based Measure of Memorization for Diffusion ModelsCode0
Synthesis of Positron Emission Tomography (PET) Images via Multi-channel Generative Adversarial Networks (GANs)Code0
A Robust Attentional Framework for License Plate Recognition in the WildCode0
SUSTechGAN: Image Generation for Object Detection in Adverse Conditions of Autonomous DrivingCode0
EAR: Erasing Concepts from Unified Autoregressive ModelsCode0
Mask Embedding in conditional GAN for Guided Synthesis of High Resolution ImagesCode0
Diffusion-based image inpainting with internal learningCode0
Combiner: Full Attention Transformer with Sparse Computation CostCode0
Unsupervised Learning of Object Landmarks through Conditional Image GenerationCode0
Show:102550
← PrevPage 133 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified