SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 58015850 of 6689 papers

TitleStatusHype
Discriminator Contrastive Divergence: Semi-Amortized Generative Modeling by Exploring Energy of the Discriminator0
CellCycleGAN: Spatiotemporal Microscopy Image Synthesis of Cell Populations using Statistical Shape Models and Conditional GANs0
LT-GAN: Self-Supervised GAN with Latent Transformation Detection0
Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational AutoencodersCode0
Training Generative Adversarial Networks via stochastic Nash games0
Semantic Editing On Segmentation Map Via Multi-Expansion Loss0
New Ideas and Trends in Deep Multimodal Content Understanding: A Review0
A Human Eye-based Text Color Scheme Generation Method for Image Synthesis0
The power of pictures: using ML assisted image generation to engage the crowd in complex socioscientific problems0
Random Network Distillation as a Diversity Metric for Both Image and Text Generation0
Experimental Quantum Generative Adversarial Networks for Image GenerationCode0
Omni-Directional Image Generation from Single Snapshot ImageCode0
Memory-efficient GAN-based Domain Translation of High Resolution 3D Medical Images0
Arbitrary Style Transfer using Graph Instance Normalization0
A Contrastive Learning Approach for Training Variational Autoencoder Priors0
Image Translation for Medical Image Generation -- Ischemic Stroke LesionsCode0
Generating Gameplay-Relevant Art Assets with Transfer LearningCode0
MGD-GAN: Text-to-Pedestrian generation through Multi-Grained Discrimination0
Tabular GANs for uneven distributionCode0
EEG to fMRI Synthesis: Is Deep Learning a candidate?0
Simplifying Models with Unlabeled Output Data0
Data Instance Prior for Transfer Learning in GANs0
Scene Graph to Image Generation with Contextualized Object Layout Refinement0
Triple-GAN with Variable Fractional Order Gradient Descent Method and Mish Activation Function0
Adversarial Rain Attack and Defensive Deraining for DNN Perception0
Dodging DeepFake Detection via Implicit Spatial-Domain Notch Filtering0
Synthetic Convolutional Features for Improved Semantic Segmentation0
Conditional Image Generation with One-Vs-All Classifier0
TreeGAN: Incorporating Class Hierarchy into Image Generation0
Deep Sinogram Completion with Image Prior for Metal Artifact Reduction in CT Images0
Conditional Spoken Digit Generation with StyleGAN0
not-so-BigGAN: Generating High-Fidelity Images on Small Compute with Wavelet-based Super-Resolution0
Improved Modeling of 3D Shapes with Multi-view Depth MapsCode0
TiVGAN: Text to Image to Video Generation with Step-by-Step Evolutionary Generator0
Simulation of an Elevator Group Control Using Generative Adversarial Networks and Related AI Tools0
Decontextualized learning for interpretable hierarchical representations of visual patternsCode0
On the Intrinsic Robustness of NVM Crossbars Against Adversarial Attacks0
Causal Adversarial Network for Learning Conditional and Interventional Distributions0
CA-GAN: Weakly Supervised Color Aware GAN for Controllable Makeup Transfer0
Perceptual underwater image enhancement with deep learning and physical priors0
Causal Future Prediction in a Minkowski Space-Time0
Improving Text to Image Generation using Mode-seeking Function0
Self-Supervised Ultrasound to MRI Fetal Brain Image SynthesisCode0
A New Perspective on Stabilizing GANs training: Direct Adversarial TrainingCode0
Person image generation with semantic attention network for person re-identification0
Category Level Object Pose Estimation via Neural Analysis-by-Synthesis0
Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder0
Uncertainty Quantification using Variational Inference for Biomedical Image SegmentationCode0
Implanting Synthetic Lesions for Improving Liver Lesion Segmentation in CT Exams0
Multimodal Image-to-Image Translation via Mutual Information Estimation and Maximization0
Show:102550
← PrevPage 117 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified