SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 65016550 of 6689 papers

TitleStatusHype
On Learning 3D Face Morphable Model from In-the-wild ImagesCode0
ρ-Diffusion: A diffusion-based density estimation framework for computational physicsCode0
ConvGeN: Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasetsCode0
HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image GenerationCode0
mFI-PSO: A Flexible and Effective Method in Adversarial Image Generation for Deep Neural NetworksCode0
Video Generation from Single Semantic Label MapCode0
WeditGAN: Few-Shot Image Generation via Latent Space RelocationCode0
cGANs with Projection DiscriminatorCode0
Co-occurrence Based Texture SynthesisCode0
Emerging Convolutions for Generative Normalizing FlowsCode0
On the Cultural Gap in Text-to-Image GenerationCode0
Unmasking Bias in Diffusion Model TrainingCode0
AutoGAN: Neural Architecture Search for Generative Adversarial NetworksCode0
Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-IdentificationCode0
LoFT: LoRA-fused Training Dataset Generation with Few-shot GuidanceCode0
Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form SolutionsCode0
LOGAN: Latent Optimisation for Generative Adversarial NetworksCode0
Which Models have Perceptually-Aligned Gradients? An Explanation via Off-Manifold RobustnessCode0
LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-TuningCode0
ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation ModelsCode0
Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine TranslationCode0
Longitudinal Causal Image SynthesisCode0
Long Tail Image Generation Through Feature Space Augmentation and Iterated LearningCode0
On the Diversity of Realistic Image SynthesisCode0
Boosting Alignment for Post-Unlearning Text-to-Image Generative ModelsCode0
Hierarchical Residual Learning Based Vector Quantized Variational Autoencoder for Image Reconstruction and GenerationCode0
Auto-Embedding Generative Adversarial Networks for High Resolution Image SynthesisCode0
CoCoG-2: Controllable generation of visual stimuli for understanding human concept representationCode0
Hierarchical VAE with a Diffusion-based VampPriorCode0
On the Issues of TrueDepth Sensor Data for Computer Vision Tasks Across Different iPad GenerationsCode0
DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image SynthesisCode0
High-Dynamic-Range Imaging for Cloud SegmentationCode0
Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion modelCode0
COCO-GAN: Generation by Parts via Conditional CoordinatingCode0
Coordinate-based Texture Inpainting for Pose-Guided Image GenerationCode0
A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative ModelsCode0
Loss-Sensitive Generative Adversarial Networks on Lipschitz DensitiesCode0
DeshuffleGAN: A Self-Supervised GAN to Improve Structure LearningCode0
CoPE: Conditional image generation using Polynomial ExpansionsCode0
Theory and Experiments on Vector Quantized AutoencodersCode0
Self-supervised GAN: Analysis and Improvement with Multi-class Minimax GameCode0
Real-Time Person Image Synthesis Using a Flow Matching ModelCode0
CODE: Confident Ordinary Differential EditingCode0
High-Frequency Anti-DreamBooth: Robust Defense against Personalized Image SynthesisCode0
The relativistic discriminator: a key element missing from standard GANCode0
LR-GAN: Layered Recursive Generative Adversarial Networks for Image GenerationCode0
Self-Supervised GANs via Auxiliary Rotation LossCode0
A Layer-Based Sequential Framework for Scene Generation with GANsCode0
Self-Supervised Image-to-Text and Text-to-Image SynthesisCode0
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image SynthesisCode0
Show:102550
← PrevPage 131 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified