SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 44514500 of 6689 papers

TitleStatusHype
A Framework For Image Synthesis Using Supervised Contrastive Learning0
Using LLMs as prompt modifier to avoid biases in AI image generators0
SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP0
SAR Image Synthesis with Diffusion Models0
SAR-NeRF: Neural Radiance Fields for Synthetic Aperture Radar Multi-View Representation0
SAR Target Image Generation Method Using Azimuth-Controllable Generative Adversarial Network0
Using Photorealistic Face Synthesis and Domain Adaptation to Improve Facial Expression Analysis0
Scaffolding Creativity: Integrating Generative AI Tools and Real-world Experiences in Business Education0
A Framework and Dataset for Abstract Art Generation via CalligraphyGAN0
A Flexible Measurement of Diversity in Datasets with Random Network Distillation0
Affordance Diffusion: Synthesizing Hand-Object Interactions0
Affective Image Filter: Reflecting Emotions from Text to Images0
Scalable Ranked Preference Optimization for Text-to-Image Generation0
Scalable, Tokenization-Free Diffusion Model Architectures with Efficient Initial Convolution and Fixed-Size Reusable Structures for On-Device Image Generation0
AffectGAN: Affect-Based Generative Art Driven by Semantics0
Scaling Diffusion Mamba with Bidirectional SSMs for Efficient Image and Video Generation0
AFAN: Augmented Feature Alignment Network for Cross-Domain Object Detection0
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production0
Scaling Inference Time Compute for Diffusion Models0
Scaling Laws For Diffusion Transformers0
Adversarial Training For Sketch Retrieval0
Using Physics Informed Generative Adversarial Networks to Model 3D porous media0
Using Scene Graph Context to Improve Image Generation0
Using Text-to-Image Generation for Architectural Design Ideation0
Adversarial Text-to-Image Synthesis: A Review0
Scene Aware Person Image Generation through Global Contextual Conditioning0
SceneBooth: Diffusion-based Framework for Subject-preserved Text-to-Image Generation0
SceneComposer: Any-Level Semantic Image Synthesis0
Adversarial Text to Continuous Image Generation0
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis0
Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation0
Scene Graph to Image Generation with Contextualized Object Layout Refinement0
Image Synthesis with Graph Conditioning: CLIP-Guided Diffusion Models for Scene Graphs0
Layout Agnostic Scene Text Image Synthesis with Diffusion Models0
U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models0
Scene Text Magnifier0
Scene Text Synthesis for Efficient and Effective Deep Network Training0
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation0
Science-T2I: Addressing Scientific Illusions in Image Synthesis0
FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency0
SciRE-Solver: Accelerating Diffusion Models Sampling by Score-integrand Solver with Recursive Difference0
SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Generation0
SCONE-GAN: Semantic Contrastive learning-based Generative Adversarial Network for an end-to-end image translation0
Score-based Conditional Generation with Fewer Labeled Data by Self-calibrating Classifier Guidance0
Utilizing Generative Adversarial Networks for Image Data Augmentation and Classification of Semiconductor Wafer Dicing Induced Defects0
UU-Nets Connecting Discriminator and Generator for Image to Image Translation0
XOGAN: One-to-Many Unsupervised Image-to-Image Translation0
Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian probability distributions0
Score Distillation Sampling with Learned Manifold Corrective0
Adversarial Pseudo Healthy Synthesis Needs Pathology Factorization0
Show:102550
← PrevPage 90 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified