SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 41014150 of 6689 papers

TitleStatusHype
Interactive Visual Assessment for Text-to-Image Generation Models0
Interleaved Scene Graphs for Interleaved Text-and-Image Generation Assessment0
Watch Your Steps: Local Image and Scene Editing by Text Instructions0
Towards GAN Benchmarks Which Require Generalization0
Interpretable ODE-style Generative Diffusion Model via Force Field Construction0
Interpreting Generative Adversarial Networks for Interactive Image Generation0
A Survey on Pre-Trained Diffusion Model Distillations0
Interpreting the Latent Space of GANs via Correlation Analysis for Controllable Concept Manipulation0
Intriguing Properties of Diffusion Models: An Empirical Study of the Natural Attack Capability in Text-to-Image Generative Models0
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search0
Intrinsic Autoencoders for Joint Neural Rendering and Intrinsic Image Decomposition0
Introducing SDICE: An Index for Assessing Diversity of Synthetic Medical Datasets0
Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter0
Invariant Representation via Decoupling Style and Spurious Features from Images0
Invariant Representations for Noisy Speech Recognition0
Inverse Painting: Reconstructing The Painting Process0
A Survey on Personalized Content Synthesis with Diffusion Models0
Towards Generalization of Tactile Image Generation: Reference-Free Evaluation in a Leakage-Free Setting0
Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps0
Watermark-embedded Adversarial Examples for Copyright Protection against Diffusion Models0
Towards Generative Latent Variable Models for Speech0
Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis0
Investigating GANsformer: A Replication Study of a State-of-the-Art Image Generation Model0
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images0
Investigating the Potential of Auxiliary-Classifier GANs for Image Classification in Low Data Regimes0
Investigation of the Impact of Synthetic Training Data in the Industrial Application of Terminal Strip Object Detection0
Towards High-quality HDR Deghosting with Conditional Diffusion Models0
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation0
IP-Composer: Semantic Composition of Visual Concepts0
180-degree Outpainting from a Single Image0
IPT-V2: Efficient Image Processing Transformer using Hierarchical Attentions0
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text0
IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning0
TrISec: Training Data-Unaware Imperceptible Security Attacks on Deep Neural Networks0
Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration0
Towards Interpretable Counterfactual Generation via Multimodal Autoregression0
Is Deep Learning Network Necessary for Image Generation?0
Inducing Hierarchical Compositional Model by Sparsifying Generator Network0
Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data0
Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise0
Isolated Diffusion: Optimizing Multi-Concept Text-to-Image Generation Training-Freely with Isolated Diffusion Guidance0
A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond0
Is Perturbation-Based Image Protection Disruptive to Image Editing?0
Towards Language-Free Training for Text-to-Image Generation0
Is There Mode Collapse? A Case Study on Face Generation and Its Black-box Calibration0
Towards Learning a Self-inverse Network for Bidirectional Image-to-image Translation0
Iterative Flow Matching -- Path Correction and Gradual Refinement for Enhanced Generative Modeling0
Iterative Multi-granular Image Editing using Diffusion Models0
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models0
Iterative Online Image Synthesis via Diffusion Model for Imbalanced Classification0
Show:102550
← PrevPage 83 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified