SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 45514600 of 6689 papers

TitleStatusHype
Is Deep Learning Network Necessary for Image Generation?0
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions0
CoC-GAN: Employing Context Cluster for Unveiling a New Pathway in Image Generation0
Learning to Decouple and Generate Seismic Random Noise via Invertible Neural NetworkCode0
Efficient Transfer Learning in Diffusion Models via Adversarial Noise0
Augmenting medical image classifiers with synthetic data from latent diffusion models0
DISGAN: Wavelet-informed Discriminator Guides GAN to MRI Super-resolution with Noise CleaningCode0
Open Set Synthetic Image Source Attribution0
Semantic RGB-D Image Synthesis0
MosaiQ: Quantum Generative Adversarial Networks for Image Generation on NISQ Computers0
Sampling From Autoencoders' Latent Space via Quantization And Probability Mass Function Concepts0
Debiasing Counterfactuals In the Presence of Spurious Correlations0
Hyperspectral Blind Unmixing using a Double Deep Image PriorCode0
SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation0
I3: Intent-Introspective Retrieval Conditioned on Instructions0
ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious CorrelationsCode0
Guide3D: Create 3D Avatars from Text and Image Guidance0
RFDforFin: Robust Deep Forgery Detection for GAN-generated Fingerprint Images0
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability0
Watch Your Steps: Local Image and Scene Editing by Text Instructions0
Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit AssignmentCode0
Diff-CAPTCHA: An Image-based CAPTCHA with Security Enhanced by Denoising Diffusion Model0
Painter: Teaching Auto-regressive Language Models to Draw Sketches0
SciRE-Solver: Accelerating Diffusion Models Sampling by Score-integrand Solver with Recursive Difference0
Semantic-aware Network for Aerial-to-Ground Image SynthesisCode0
MarkovGen: Structured Prediction for Efficient Text-to-Image Generation0
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts0
Camouflaged Image Synthesis Is All You Need to Boost Camouflaged Detection0
BATINet: Background-Aware Text to Image Synthesis and Manipulation Network0
SAR Target Image Generation Method Using Azimuth-Controllable Generative Adversarial Network0
On Error Propagation of Diffusion Models0
Vector quantization loss analysis in VQGANs: a single-GPU ablation study for image-to-image synthesisCode0
PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like InteractionsCode0
TextPainter: Multimodal Text Image Generation with Visual-harmony and Text-comprehension for Poster Design0
DiffSynth: Latent In-Iteration Deflickering for Realistic Video Synthesis0
Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation0
SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation0
Large-Scale Public Data Improves Differentially Private Image Generation Quality0
Focus on Content not Noise: Improving Image Generation for Nuclei Segmentation by Suppressing Steganography in CycleGAN0
Partial Label Supervision for Agnostic Generative Noisy Label LearningCode0
Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image GenerationCode0
Reverse Stable Diffusion: What prompt was used to generate this image?Code0
Training Data Protection with Compositional Diffusion Models0
Experiments on Generative AI-Powered Parametric Modeling and BIM for Architectural Design0
Metrics to Quantify Global Consistency in Synthetic Medical Images0
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models0
The Bias Amplification Paradox in Text-to-Image GenerationCode0
DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation0
HiREN: Towards Higher Supervision Quality for Better Scene Text Image Super-Resolution0
Stylized Projected GAN: A Novel Architecture for Fast and Realistic Image Generation0
Show:102550
← PrevPage 92 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified