SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 33513400 of 6689 papers

TitleStatusHype
LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis0
Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding0
LG-VQ: Language-Guided Codebook Learning0
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models0
Weakly-Supervised Stitching Network for Real-World Panoramic Image Generation0
Lifelong GAN: Continual Learning for Conditional Image Generation0
A self-attention-based differentially private tabular GAN with high data utility0
LightIt: Illumination Modeling and Control for Diffusion Models0
Lightweight Long-Range Generative Adversarial Networks0
Training Data Protection with Compositional Diffusion Models0
LIME: Localized Image Editing via Attention Regularization in Diffusion Models0
LineArt: A Knowledge-guided Training-free High-quality Appearance Transfer for Design Drawing with Diffusion Model0
Line Artist: A Multiple Style Sketch to Painting Synthesis Scheme0
DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers0
LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis0
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation0
LiT: Delving into a Simplified Linear Diffusion Transformer for Image Generation0
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models0
LMFusion: Adapting Pretrained Language Models for Multimodal Generation0
A Scaling Law for Syn-to-Real Transfer: How Much Is Your Pre-training Effective?0
Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning0
Training Diffusion Models with Federated Learning0
LLM as an Art Director (LaDi): Using LLMs to improve Text-to-Media Generators0
Training End-to-end Single Image Generators without GANs0
Artworks Reimagined: Exploring Human-AI Co-Creation through Body Prompting0
Training Energy-Based Models with Diffusion Contrastive Divergences0
ARTIST: Improving the Generation of Text-rich Images with Disentangled Diffusion Models and Large Language Models0
Training face verification models from generated face identity data0
Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis0
LM-IGTD: a 2D image generator for low-dimensional and mixed-type tabular data to leverage the potential of convolutional neural networks0
Load Balanced GANs for Multi-view Face Image Synthesis0
Abnormal Colon Polyp Image Synthesis Using Conditional Adversarial Networks for Improved Detection Performance0
Local Attention Pyramid for Scene Image Generation0
Local Conditional Controlling for Text-to-Image Diffusion Models0
Local Curvature Smoothing with Stein's Identity for Efficient Score Matching0
Local Differential Privacy Image Generation Using Flow-based Deep Generative Models0
Locality-Aware Generalizable Implicit Neural Representation0
Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation0
Training-free Diffusion Acceleration with Bottleneck Sampling0
Localized Text-to-Image Generation for Free via Cross Attention Control0
Localizing and Editing Knowledge in Text-to-Image Generative Models0
Locally-Focused Face Representation for Sketch-to-Image Generation Using Noise-Induced Refinement0
WEM-GAN: Wavelet transform based facial expression manipulation0
Local Statistics for Generative Image Detection0
LoCo: Locally Constrained Training-Free Layout-to-Image Synthesis0
Training-free Editioning of Text-to-Image Models0
LogicRank: Logic Induced Reranking for Generative Text-to-Image Systems0
ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary0
Training-Free Identity Preservation in Stylized Image Generation Using Diffusion Models0
LoomNet: Enhancing Multi-View Image Generation via Latent Space Weaving0
Show:102550
← PrevPage 68 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified