SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 52015250 of 6689 papers

TitleStatusHype
LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems0
Deformation equivariant cross-modality image synthesis with paired non-aligned training dataCode0
Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model0
Understanding Diffusion Models: A Unified Perspective0
Unsupervised Structure-Consistent Image-to-Image Translation0
DepthFake: a depth-based strategy for detecting Deepfake videos0
The Value of AI Guidance in Human Examination of Synthetically-Generated Faces0
FurryGAN: High Quality Foreground-aware Image Synthesis0
Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks0
Text to Image Generation: Leaving no Language Behind0
Discovering Bugs in Vision Models using Off-the-shelf Image Generation and Captioning0
T-Person-GAN: Text-to-Person Image Generation with Identity-Consistency and Manifold Mix-UpCode0
Enhancing Diffusion-Based Image Synthesis with Robust Classifier GuidanceCode0
Time flies by: Analyzing the Impact of Face Ageing on the Recognition Performance with Synthetic Data0
Understanding Attention for Vision-and-Language TasksCode0
Text-to-Image Generation via Implicit Visual Guidance and Hypernetwork0
Novel Deep Learning Approach to Derive Cytokeratin Expression and Epithelium Segmentation from DAPI0
Riemannian Diffusion Models0
SGM-Net: Semantic Guided Matting Net0
Memory-Driven Text-to-Image Generation0
Layout-Bridging Text-to-Image Synthesis0
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion DesignCode0
Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics Driven Reinforcement Learning0
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and GenerationCode0
Inflating 2D Convolution Weights for Efficient Generation of 3D Medical Images0
Seamless Iterative Semi-Supervised Correction of Imperfect Labels in Microscopy ImagesCode0
Artificial Image Tampering Distorts Spatial Distribution of Texture Landmarks and Quality Characteristics0
Adversarial Attacks on Image Generation With Made-Up Words0
Pyramidal Denoising Diffusion Probabilistic Models0
Character Generation through Self-Supervised Vectorization0
Counterfactual Image Synthesis for Discovery of Personalized Predictive Image Markers0
AdaWCT: Adaptive Whitening and Coloring Style Injection0
Momentum Transformer: Closing the Performance Gap Between Self-attention and Its Linearization0
Testing Relational Understanding in Text-Guided Image Generation0
Conservative Generator, Progressive Discriminator: Coordination of Adversaries in Few-shot Incremental Image Synthesis0
Generative Steganography Network0
Automated Classification of Nanoparticles with Various Ultrastructures and Sizes0
Leveraging GAN Priors for Few-Shot Part SegmentationCode0
PASTA-GAN++: A Versatile Framework for High-Resolution Unpaired Virtual Try-on0
Vector Quantized Image-to-Image Translation0
Contrastive Image Synthesis and Self-supervised Feature Adaptation for Cross-Modality Biomedical Image SegmentationCode0
Multi-Forgery Detection Challenge 2022: Push the Frontier of Unconstrained and Diverse Forgery Detection0
Few-shot Image Generation Using Discrete Content Representation0
Auto-regressive Image Synthesis with Integrated Quantization0
Image Generation Network for Covert Transmission in Online Social Network0
Everything is There in Latent Space: Attribute Editing and Attribute Style Manipulation by StyleGAN Latent Space Exploration0
Cross-Modal Contrastive Representation Learning for Audio-to-Image Generation0
2D GANs Meet Unsupervised Single-view 3D Reconstruction0
Image Synthesis with Disentangled Attributes for Chest X-Ray Nodule Augmentation and Detection0
On the Study of Sample Complexity for Polynomial Neural Networks0
Show:102550
← PrevPage 105 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified