SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 14511500 of 6689 papers

TitleStatusHype
Deep Image Spatial Transformation for Person Image GenerationCode1
Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic Image SynthesisCode1
Freestyle Layout-to-Image SynthesisCode1
Freeze the Discriminator: a Simple Baseline for Fine-Tuning GANsCode1
LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image GenerationCode1
Learning Flow-based Feature Warping for Face Frontalization with Illumination Inconsistent SupervisionCode1
Learning Semantic-aware Normalization for Generative Adversarial NetworksCode1
Deep Linear Array Pushbroom Image Restoration: A Degradation Pipeline and Jitter-Aware Restoration NetworkCode1
LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image GenerationCode1
DyDiT++: Dynamic Diffusion Transformers for Efficient Visual GenerationCode1
Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly DetectionCode1
From Face to Natural Image: Learning Real Degradation for Blind Image Super-ResolutionCode1
Dual Pyramid Generative Adversarial Networks for Semantic Image SynthesisCode1
Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantificationCode1
Deep Novel View Synthesis from Colored 3D Point CloudsCode1
Latent Normalizing Flows for Many-to-Many Cross-Domain MappingsCode1
Dual-Domain Image Synthesis using Segmentation-Guided GANCode1
Latent Denoising Diffusion GAN: Faster sampling, Higher image qualityCode1
Aligning Text to Image in Diffusion Models is Easier Than You ThinkCode1
CookGAN: Meal Image Synthesis from IngredientsCode1
Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical ImagingCode1
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion ModelsCode1
Image Shape Manipulation from a Single Augmented Training SampleCode1
Bipartite Graph Reasoning GANs for Person Image GenerationCode1
Deep Sketch-guided Cartoon Video InbetweeningCode1
Deep Spatial Transformation for Pose-Guided Person Image Generation and AnimationCode1
Fuzz-Testing Meets LLM-Based Agents: An Automated and Efficient Framework for Jailbreaking Text-To-Image Generation ModelsCode1
Deep Unrolled Low-Rank Tensor Completion for High Dynamic Range ImagingCode1
DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality LearningCode1
AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video AvatarsCode1
BIVA: A Very Deep Hierarchy of Latent Variables for Generative ModelingCode1
AutoDiffusion: Training-Free Optimization of Time Steps and Architectures for Automated Diffusion Model AccelerationCode1
CooGAN: A Memory-Efficient Framework for High-Resolution Facial Attribute EditingCode1
Large Scale GAN Training for High Fidelity Natural Image SynthesisCode1
BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion ModelsCode1
Dual Contrastive Loss and Attention for GANsCode1
Dynamic Neural Radiance Fields for Monocular 4D Facial Avatar ReconstructionCode1
LaCon: Late-Constraint Diffusion for Steerable Guided Image SynthesisCode1
Artificial Fingerprinting for Generative Models: Rooting Deepfake Attribution in Training DataCode1
GAN Path Finder: Preliminary resultsCode1
An Image is Worth Multiple Words: Discovering Object Level Concepts using Multi-Concept Prompt LearningCode1
GANs Can Play Lottery Tickets TooCode1
Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and SynthesisCode1
Blackout Diffusion: Generative Diffusion Models in Discrete-State SpacesCode1
GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash EquilibriumCode1
DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan DesignCode1
Drop the GAN: In Defense of Patches Nearest Neighbors as Single Image Generative ModelsCode1
Dual Attention GANs for Semantic Image SynthesisCode1
Dual-Diffusion: Dual Conditional Denoising Diffusion Probabilistic Models for Blind Super-Resolution Reconstruction in RSIsCode1
LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable DirectionsCode1
Show:102550
← PrevPage 30 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified