SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 65016550 of 6689 papers

TitleStatusHype
Image-Guided Microstructure Optimization using Diffusion Models: Validated with Li-Mn-rich Cathode Precursors0
Image-guided Neural Object Rendering0
Image Inpainting Models are Effective Tools for Instruction-guided Image Editing0
A Two Stage GAN for High Resolution Retinal Image Generation and Segmentation0
Imagen Video: High Definition Video Generation with Diffusion Models0
ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation0
Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models0
Accelerating Score-based Generative Models for High-Resolution Image Synthesis0
ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image0
Accelerating Mobile Edge Generation (MEG) by Constrained Learning0
Image Super-Resolution With Deep Variational Autoencoders0
TiVGAN: Text to Image to Video Generation with Step-by-Step Evolutionary Generator0
Image Synthesis as a Pretext for Unsupervised Histopathological Diagnosis0
Image Synthesis-based Late Stage Cancer Augmentation and Semi-Supervised Segmentation for MRI Rectal Cancer Staging0
Image Synthesis for Data Augmentation in Medical CT using Deep Reinforcement Learning0
TL-GAN: Improving Traffic Light Recognition via Data Synthesis for Autonomous Driving0
TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP0
ToddlerDiffusion: Interactive Structured Image Generation with Cascaded Schrödinger Bridge0
Image Synthesis via Semantic Composition0
Toffee: Efficient Million-Scale Dataset Construction for Subject-Driven Text-to-Image Generation0
Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation0
Image Synthesis with Disentangled Attributes for Chest X-Ray Nodule Augmentation and Detection0
Attribute Regularized Soft Introspective VAE: Towards Cardiac Attribute Regularization Through MRI Domains0
Image-to-Image Translation: Methods and Applications0
Image Transformer0
Attribute Controllable Beautiful Caucasian Face Generation by Aesthetics Driven Reinforcement Learning0
Accelerating Image Generation with Sub-path Linear Approximation Model0
Token Fusion: Bridging the Gap between Token Pruning and Token Merging0
IMAGINE: Image Synthesis by Image-Guided Model Inversion0
Imagine yourself: Tuning-Free Personalized Image Generation0
Attribute-Centric Compositional Text-to-Image Generation0
Imperceptible Protection against Style Imitation from Diffusion Models0
Implanting Synthetic Lesions for Improving Liver Lesion Segmentation in CT Exams0
Implicit and Explicit Language Guidance for Diffusion-based Visual Perception0
Accelerating Diffusion Models with One-to-Many Knowledge Distillation0
Implicit Priors Editing in Stable Diffusion via Targeted Token Adjustment0
Accelerating Diffusion Models via Pre-segmentation Diffusion Sampling for Medical Image Segmentation0
Importance-Based Token Merging for Efficient Image and Video Generation0
Accelerate High-Quality Diffusion Models with Inner Loop Feedback0
Impression Space from Deep Template Network0
Improved Consistency Regularization for GANs0
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation0
Improved Image Generation via Sparse Modeling0
Improved Image Generation via Sparsity0
Improved Masked Image Generation with Token-Critic0
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models0
Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks0
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models0
TopicGAN: Unsupervised Text Generation from Explainable Latent Topics0
AttentionRNN: A Structured Spatial Attention Mechanism0
Show:102550
← PrevPage 131 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified