SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 50515100 of 6689 papers

TitleStatusHype
CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models0
CustomSketching: Sketch Concept Extraction for Sketch-based Image Synthesis and Editing0
CustomText: Customized Textual Image Generation using Diffusion Models0
CustomVideoX: 3D Reference Attention Driven Dynamic Adaptation for Zero-Shot Customized Video Diffusion Transformers0
Anomaly Anything: Promptable Unseen Visual Anomaly Generation0
Cut and Continuous Paste towards Real-time Deep Fall Detection0
View Extrapolation of Human Body from a Single Image0
Cycle-Consistent Inverse GAN for Text-to-Image Synthesis0
CycleGAN Models for MRI Image Translation0
CycleGAN without checkerboard artifacts for counter-forensics of fake-image detection0
Cycle Generative Adversarial Networks Algorithm With Style Transfer For Image Generation0
Stochastic Conditional Generative Networks with Basis Decomposition0
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts0
StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces0
Stomach 3D Reconstruction Based on Virtual Chromoendoscopic Image Generation0
"Stones from Other Hills can Polish Jade": Zero-shot Anomaly Image Synthesis via Cross-domain Anomaly Injection0
D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens0
A Deep and Tractable Density Estimator0
D3T-GAN: Data-Dependent Domain Transfer GANs for Few-shot Image Generation0
Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling0
DAFT-GAN: Dual Affine Transformation Generative Adversarial Network for Text-Guided Image Inpainting0
DALL-E for Detection: Language-driven Compositional Image Synthesis for Object Detection0
DAM-GAN : Image Inpainting using Dynamic Attention Map based on Fake Texture Detection0
DANBO: Disentangled Articulated Neural Body Representations via Graph Neural Networks0
DANCE: Deep Learning-Assisted Analysis of Protein Sequences Using Chaos Enhanced Kaleidoscopic Images0
DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation0
Data-Agnostic Face Image Synthesis Detection Using Bayesian CNNs0
Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval0
Data Augmentation using Feature Generation for Volumetric Medical Images0
CapHDR2IR: Caption-Driven Transfer from Visible Light to Infrared Domain0
Data-Driven but Privacy-Conscious: Pedestrian Dataset De-identification via Full-Body Person Synthesis0
Photoacoustic image synthesis with generative adversarial networks0
Storynizor: Consistent Story Generation via Inter-Frame Synchronized and Shuffled ID Injection0
Data Forensics in Diffusion Models: A Systematic Analysis of Membership Privacy0
Data-Free Group-Wise Fully Quantized Winograd Convolution via Learnable Scales0
Data Instance Prior for Transfer Learning in GANs0
Data InStance Prior (DISP) in Generative Adversarial Networks0
Data Priming Network for Automatic Check-Out0
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation0
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer0
DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models0
DDIM-Driven Coverless Steganography Scheme with Real Key0
StraIT: Non-autoregressive Generation with Stratified Image Transformer0
DDMM-Synth: A Denoising Diffusion Model for Cross-modal Medical Image Synthesis with Sparse-view Measurement Embedding0
DDPM based X-ray Image Synthesizer0
DDRF: Denoising Diffusion Model for Remote Sensing Image Fusion0
Can We Use Diffusion Probabilistic Models for 3D Motion Prediction?0
DEANet: Decomposition Enhancement and Adjustment Network for Low-Light Image Enhancement0
Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models0
Debiasing Counterfactuals In the Presence of Spurious Correlations0
Show:102550
← PrevPage 102 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified