SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 32513300 of 6689 papers

TitleStatusHype
Diffscaler: Enhancing the Generative Prowess of Diffusion Transformers0
Improving Diffusion-Based Image Editing Faithfulness via Guidance and Scheduling0
Improving Diffusion-Based Image Synthesis with Context Prediction0
Improving Editability in Image Generation with Layer-wise Memory0
An Analysis on Quantizing Diffusion Transformers0
Generative Steganography Network0
Generative Steganography Diffusion0
Adaptive Multi-stage Density Ratio Estimation for Learning Latent Space Energy-based Model0
Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation0
Improving GANs with A Dynamic Discriminator0
Latent Space Consistency for Sparse-View CT Reconstruction0
Generative Steganographic Flow0
DDRF: Denoising Diffusion Model for Remote Sensing Image Fusion0
Beware of Aliases -- Signal Preservation is Crucial for Robust Image Restoration0
Generative Sensing: Transforming Unreliable Sensor Data for Reliable Recognition0
Generative Self-training for Cross-domain Unsupervised Tagged-to-Cine MRI Synthesis0
DDPM based X-ray Image Synthesizer0
An Analysis of Human Alignment of Latent Diffusion Models0
Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models0
Generative Probabilistic Image Colorization0
Generative Portrait Shadow Removal0
DDMM-Synth: A Denoising Diffusion Model for Cross-modal Medical Image Synthesis with Sparse-view Measurement Embedding0
Latent Space Bayesian Optimization with Latent Data Augmentation for Enhanced Exploration0
Generative OpenMax for Multi-Class Open Set Classification0
Analyzing the Latent Space of GAN through Local Dimension Estimation0
DiffuseGAE: Controllable and High-fidelity Image Manipulation from Disentangled Representation0
DDIM-Driven Coverless Steganography Scheme with Real Key0
3D-LDM: Neural Implicit 3D Shape Generation with Latent Diffusion Models0
Cross Modality Medical Image Synthesis for Improving Liver Segmentation0
Generative networks as inverse problems with fractional wavelet scattering networks0
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens0
Beta Sampling is All You Need: Efficient Image Generation Strategy for Diffusion Models using Stepwise Spectral Analysis0
Adaptive Multiplane Image Generation from a Single Internet Picture0
LatentQGAN: A Hybrid QGAN with Classical Convolutional Autoencoder0
Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation0
Latent Space Disentanglement in Diffusion Transformers Enables Precise Zero-shot Semantic Editing0
LatexBlend: Scaling Multi-concept Customized Generation with Latent Textual Blending0
Latent Diffusion Models for Structural Component Design0
Latent Dirichlet Allocation in Generative Adversarial Networks0
Generative models for visualising abstract social processes: Guiding streetview image synthesis of StyleGAN2 with indices of deprivation0
DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models0
Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models0
Latent Drifting in Diffusion Models for Counterfactual Medical Image Synthesis0
In-Domain GAN Inversion for Faithful Reconstruction and Editability0
Generative Modelling with High-Order Langevin Dynamics0
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer0
Benefiting Deep Latent Variable Models via Learning the Prior and Removing Latent Regularization0
Inference-time Scaling of Diffusion Models through Classical Search0
Adaptive Multi-modal Fusion of Spatially Variant Kernel Refinement with Diffusion Model for Blind Image Super-Resolution0
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation0
Show:102550
← PrevPage 66 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified