SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 27012750 of 6689 papers

TitleStatusHype
Long Tail Image Generation Through Feature Space Augmentation and Iterated LearningCode0
Covid-19 chest x-ray image generation using resnet-dcgan modelCode0
LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-TuningCode0
Active Generation for Image ClassificationCode0
LOGAN: Latent Optimisation for Generative Adversarial NetworksCode0
Auto-painter: Cartoon Image Generation from Sketch by Using Conditional Generative Adversarial NetworksCode0
Flow Plugin Network for conditional generationCode0
Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through OptionsCode0
LoFT: LoRA-fused Training Dataset Generation with Few-shot GuidanceCode0
Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture DesignCode0
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention MapsCode0
Automatic Semantic Style Transfer using Deep Convolutional Neural Networks and Soft MasksCode0
LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and QuantizationCode0
An Inversion-based Measure of Memorization for Diffusion ModelsCode0
Automatic Generation of Semantic Parts for Face Image SynthesisCode0
Lemotif: An Affective Visual Journal Using Deep Neural NetworksCode0
Leveraging GAN Priors for Few-Shot Part SegmentationCode0
Learning to Rearrange Voxels in Binary Segmentation Masks for Smooth Manifold TriangulationCode0
Learning with Stochastic OrdersCode0
FlexControl: Computation-Aware ControlNet with Differentiable Router for Text-to-Image GenerationCode0
Learning to Generate with MemoryCode0
Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image SynthesisCode0
Learn to synthesize and synthesize to learnCode0
Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit AssignmentCode0
Learning to Decouple and Generate Seismic Random Noise via Invertible Neural NetworkCode0
Learning Texture Manifolds with the Periodic Spatial GANCode0
Learning Stationary Markov Processes with Contrastive AdjustmentCode0
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion ModelingCode0
Neural Wireframe Renderer: Learning Wireframe to Image TranslationsCode0
FissionVAE: Federated Non-IID Image Generation with Latent Space and Decoder DecompositionCode0
Learning Portrait Style RepresentationsCode0
ACT-Diffusion: Efficient Adversarial Consistency Training for One-step Diffusion ModelsCode0
Generative Latent FlowCode0
Generative Max-Mahalanobis Classifiers for Image Classification, Generation and MoreCode0
Learning Partonomic 3D Reconstruction from Image CollectionsCode0
First Order Generative Adversarial NetworksCode0
Generative Modeling by Inclusive Neural Random Fields with Applications in Image Generation and Anomaly DetectionCode0
AutoLoss: Learning Discrete Schedules for Alternate OptimizationCode0
Loss-Sensitive Generative Adversarial Networks on Lipschitz DensitiesCode0
Generative modeling of seismic data using score-based generative modelsCode0
Fine Tuning Text-to-Image Diffusion Models for Correcting Anomalous ImagesCode0
Generative Modeling with Explicit MemoryCode0
Learning Hierarchical Semantic Image Manipulation through Structured RepresentationsCode0
CoPE: Conditional image generation using Polynomial ExpansionsCode0
Learning Generative Models of Tissue Organization with Supervised GANsCode0
AutoGAN: Neural Architecture Search for Generative Adversarial NetworksCode0
Multivariate rank via entropic optimal transport: sample efficiency and generative modelingCode0
Learning Graph Representation of Agent DiffusersCode0
Learning Spatially-Adaptive Squeeze-Excitation Networks for Image Synthesis and Image RecognitionCode0
Learning Disentangled Representations via Mutual Information EstimationCode0
Show:102550
← PrevPage 55 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified