SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 24512475 of 6689 papers

TitleStatusHype
Denoising and Regularization via Exploiting the Structural Bias of Convolutional GeneratorsCode0
Modeling Emotions and Ethics with Large Language ModelsCode0
Modular StoryGAN with Background and Theme Awareness for Story VisualizationCode0
More Expressive Attention with Negative WeightsCode0
Mobile Edge Generation-Enabled Digital Twin: Architecture Design and Research OpportunitiesCode0
DeLiGAN : Generative Adversarial Networks for Diverse and Limited DataCode0
BLADERUNNER: Rapid Countermeasure for Synthetic (AI-Generated) StyleGAN FacesCode0
MMaDA: Multimodal Large Diffusion Language ModelsCode0
Deformation equivariant cross-modality image synthesis with paired non-aligned training dataCode0
Mixture-of-Subspaces in Low-Rank AdaptationCode0
Deformable GANs for Pose-based Human Image GenerationCode0
MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image GenerationCode0
DEff-GAN: Diverse Attribute Transfer for Few-Shot Image SynthesisCode0
Deferred Neural Rendering: Image Synthesis using Neural TexturesCode0
Black-Box Diagnosis and Calibration on GAN Intra-Mode Collapse: A Pilot StudyCode0
Model-based Cleaning of the QUILT-1M Pathology Dataset for Text-Conditional Image SynthesisCode0
Defending Neural Backdoors via Generative Distribution ModelingCode0
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and GenerationCode0
MintNet: Building Invertible Neural Networks with Masked ConvolutionsCode0
MirrorGAN: Learning Text-to-image Generation by RedescriptionCode0
MISFIT-V: Misaligned Image Synthesis and Fusion using Information from Thermal and VisualCode0
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPOCode0
MIGS: Meta Image Generation from Scene GraphsCode0
Multi-Agent Multimodal Models for Multicultural Text to Image GenerationCode0
Deep residual inception encoder–decoder network for medical imaging synthesisCode0
Show:102550
← PrevPage 99 of 268Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified