SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 54515500 of 6689 papers

TitleStatusHype
A Generative Adversarial Network for AI-Aided Chair Design0
SakugaFlow: A Stagewise Illustration Framework Emulating the Human Drawing Process and Providing Interactive Tutoring for Novice Drawing Skills0
Arbitrary Handwriting Image Style Transfer0
Saliency Guided Optimization of Diffusion Latents0
SALSA-TEXT : self attentive latent space based adversarial text generation0
Using GANs to Synthesise Minimum Training Data for Deepfake Generation0
SAM-GAN: Self-Attention supporting Multi-stage Generative Adversarial Networks for text-to-image synthesis0
SAM.MD: Zero-shot medical image segmentation capabilities of the Segment Anything Model0
Sampling From Autoencoders' Latent Space via Quantization And Probability Mass Function Concepts0
AGAN: Towards Automated Design of Generative Adversarial Networks0
A Framework For Image Synthesis Using Supervised Contrastive Learning0
Using LLMs as prompt modifier to avoid biases in AI image generators0
SANER: Annotation-free Societal Attribute Neutralizer for Debiasing CLIP0
SAR Image Synthesis with Diffusion Models0
SAR-NeRF: Neural Radiance Fields for Synthetic Aperture Radar Multi-View Representation0
SAR Target Image Generation Method Using Azimuth-Controllable Generative Adversarial Network0
Using Photorealistic Face Synthesis and Domain Adaptation to Improve Facial Expression Analysis0
Scaffolding Creativity: Integrating Generative AI Tools and Real-world Experiences in Business Education0
A Framework and Dataset for Abstract Art Generation via CalligraphyGAN0
A Flexible Measurement of Diversity in Datasets with Random Network Distillation0
Affordance Diffusion: Synthesizing Hand-Object Interactions0
Affective Image Filter: Reflecting Emotions from Text to Images0
Scalable Ranked Preference Optimization for Text-to-Image Generation0
Scalable, Tokenization-Free Diffusion Model Architectures with Efficient Initial Convolution and Fixed-Size Reusable Structures for On-Device Image Generation0
AffectGAN: Affect-Based Generative Art Driven by Semantics0
Scaling Diffusion Mamba with Bidirectional SSMs for Efficient Image and Video Generation0
AFAN: Augmented Feature Alignment Network for Cross-Domain Object Detection0
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production0
Scaling Inference Time Compute for Diffusion Models0
Scaling Laws For Diffusion Transformers0
Adversarial Training For Sketch Retrieval0
Using Physics Informed Generative Adversarial Networks to Model 3D porous media0
Using Scene Graph Context to Improve Image Generation0
Using Text-to-Image Generation for Architectural Design Ideation0
Adversarial Text-to-Image Synthesis: A Review0
Scene Aware Person Image Generation through Global Contextual Conditioning0
SceneBooth: Diffusion-based Framework for Subject-preserved Text-to-Image Generation0
SceneComposer: Any-Level Semantic Image Synthesis0
Adversarial Text to Continuous Image Generation0
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis0
Scene Graph Disentanglement and Composition for Generalizable Complex Image Generation0
Scene Graph to Image Generation with Contextualized Object Layout Refinement0
Image Synthesis with Graph Conditioning: CLIP-Guided Diffusion Models for Scene Graphs0
Layout Agnostic Scene Text Image Synthesis with Diffusion Models0
U-Sketch: An Efficient Approach for Sketch to Image Diffusion Models0
Scene Text Magnifier0
Scene Text Synthesis for Efficient and Effective Deep Network Training0
Schedule On the Fly: Diffusion Time Prediction for Faster and Better Image Generation0
Science-T2I: Addressing Scientific Illusions in Image Synthesis0
FreqPolicy: Efficient Flow-based Visuomotor Policy via Frequency Consistency0
Show:102550
← PrevPage 110 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified