SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 57515800 of 6689 papers

TitleStatusHype
Example-Guided Image Synthesis using Masked Spatial-Channel Attention and Self-Supervision0
Example-Guided Scene Image Synthesis using Masked Spatial-Channel Attention and Patch-Based Self-Supervision0
Example-Guided Style-Consistent Image Synthesis From Semantic Labeling0
TCFG: Tangential Damping Classifier-free Guidance0
Exemplar-based Generative Facial Editing0
Exocentric to Egocentric Image Generation via Parallel Generative Adversarial Network0
Experiments on Generative AI-Powered Parametric Modeling and BIM for Architectural Design0
Explainable Concept Generation through Vision-Language Preference Learning0
TcGAN: Semantic-Aware and Structure-Preserved GANs with Individual Vision Transformer for Fast Arbitrary One-Shot Image Generation0
TCIG: Two-Stage Controlled Image Generation with Quality Enhancement through Diffusion0
Explicit Gradient Learning0
Explicit Gradient Learning for Black-Box Optimization0
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model0
Exploiting Knowledge Distillation for Few-Shot Image Generation0
Exploiting Relationship for Complex-scene Image Generation0
Exploiting Watermark-Based Defense Mechanisms in Text-to-Image Diffusion Models for Unauthorized Data Usage0
Explore the Power of Synthetic Data on Few-shot Object Detection0
Explore the vulnerability of black-box models via diffusion models0
Exploring Cellular Protein Localization Through Semantic Image Synthesis0
Exploring Compositional Visual Generation with Latent Classifier Guidance0
Visual Relationship Detection using Scene Graphs: A Survey0
TDRI: Two-Phase Dialogue Refinement and Co-Adaptation for Interactive Image Generation0
Exploring Disentangled and Controllable Human Image Synthesis: From End-to-End to Stage-by-Stage0
VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL0
Teaching Metric Distance to Autoregressive Multimodal Foundational Models0
Exploring Intra-Class Variation Factors With Learnable Cluster Prompts for Semi-Supervised Image Synthesis0
Exploring Language Patterns of Prompts in Text-to-Image Generation and Their Impact on Visual Diversity0
Teaching Text-to-Image Models to Communicate in Dialog0
Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation0
Exploring Social Bias in Downstream Applications of Text-to-Image Foundation Models0
Exploring StyleGAN Latent Space for Face Alignment with Limited Training Data0
BitMark for Infinity: Watermarking Bitwise Autoregressive Image Generative Models0
Exploring the Effectiveness of Deep Features from Domain-Specific Foundation Models in Retinal Image Synthesis0
Exploring the Effectiveness of Mask-Guided Feature Modulation as a Mechanism for Localized Style Editing of Real Images0
Exploring the Potential of Generative AI for the World Wide Web0
Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models0
Exploring Transformer Backbones for Image Diffusion Models0
Exploring Variational Autoencoders for Medical Image Generation: A Comprehensive Study0
Exploring Visual Vulnerabilities via Multi-Loss Adversarial Search for Jailbreaking Vision-Language Models0
Bi-parametric prostate MR image synthesis using pathology and sequence-conditioned stable diffusion0
Exposing GAN-synthesized Faces Using Landmark Locations0
Exposing the Fake: Effective Diffusion-Generated Images Detection0
Bracket Diffusion: HDR Image Generation by Consistent LDR Denoising0
BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys0
ExSinGAN: Learning an Explainable Generative Model from a Single Image0
TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis0
SkyDiffusion: Ground-to-Aerial Image Synthesis with Diffusion Models and BEV Paradigm0
Extracting Semantic Knowledge from GANs with Unsupervised Learning0
Extrapolating Prospective Glaucoma Fundus Images through Diffusion Model in Irregular Longitudinal Sequences0
ExtremeAIGC: Benchmarking LMM Vulnerability to AI-Generated Extremist Content0
Show:102550
← PrevPage 116 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified