SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 47014750 of 6689 papers

TitleStatusHype
Elucidating the Design Space of Diffusion-Based Generative ModelsCode4
Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and ReasoningCode1
Improved Vector Quantized Diffusion ModelsCode2
MontageGAN: Generation and Assembly of Multiple Components by GANsCode0
From Keypoints to Object Landmarks via Self-Training Correspondence: A novel approach to Unsupervised Landmark DiscoveryCode0
Text2Human: Text-Driven Controllable Human Image GenerationCode2
IDE-3D: Interactive Disentangled Editing for High-Resolution 3D-aware Portrait SynthesisCode2
Maximum Likelihood Training of Implicit Nonlinear Diffusion ModelsCode1
Learning with Stochastic OrdersCode0
Analytical Interpretation of Latent Codes in InfoGAN with SAR Images0
Analyzing the Latent Space of GAN through Local Dimension Estimation0
Mutual Information Divergence: A Unified Metric for Multimodal Generative ModelsCode1
Structured Uncertainty in the Observation Space of Variational AutoencodersCode0
Accelerating Diffusion Models via Early Stop of the Diffusion ProcessCode1
Text-to-Face Generation with StyleGAN20
Pre-trained Perceptual Features Improve Differentially Private Image GenerationCode0
Unsupervised Misaligned Infrared and Visible Image Fusion via Cross-Modality Image Generation and RegistrationCode2
EBM Life Cycle: MCMC Strategies for Synthesis, Defense, and Density ModelingCode0
M6-Fashion: High-Fidelity Multi-modal Image Generation and Editing0
Improving Human Image Synthesis with Residual Fast Fourier Transformation and Wasserstein Distance0
HessianFR: An Efficient Hessian-based Follow-the-Ridge Algorithm for Minimax Optimization0
GR-GAN: Gradual Refinement Text-to-image GenerationCode0
A SSIM Guided cGAN Architecture For Clinically Driven Generative Image Synthesis of Multiplexed Spatial Proteomics Channels0
Generation of Artificial CT Images using Patch-based Conditional Generative Adversarial NetworksCode0
Covid-19 chest x-ray image generation using resnet-dcgan modelCode0
Computerized Tomography Pulmonary Angiography Image Simulation using Cycle Generative Adversarial Network from Chest CT imaging in Pulmonary Embolism Patients0
BBDM: Image-to-image Translation with Brownian Bridge Diffusion ModelsCode2
Conditional Vector Graphics Generation for Music Cover ImagesCode1
StyLandGAN: A StyleGAN based Landscape Image Synthesis using Depth-map0
The Effectiveness of Temporal Dependency in Deepfake Video Detection0
D3T-GAN: Data-Dependent Domain Transfer GANs for Few-shot Image Generation0
Ray Priors through Reprojection: Improving Neural Radiance Fields for Novel View Extrapolation0
Transformer-based Cross-Modal Recipe Embeddings with Large Batch Training0
A Closer Look at Few-shot Image Generation0
On Conditioning the Input Noise for Controlled Image Generation with Diffusion Models0
CCMB: A Large-scale Chinese Cross-modal BenchmarkCode1
High-Resolution UAV Image Generation for Sorghum Panicle Detection0
GAN-Based Multi-View Video Coding with Spatio-Temporal EPI Reconstruction0
Text to artistic image generationCode0
DeepPortraitDrawing: Generating Human Body Images from Freehand Sketches0
BiOcularGAN: Bimodal Synthesis and Annotation of Ocular ImagesCode0
Subspace Diffusion Generative ModelsCode1
A Comprehensive Survey of Image Augmentation Techniques for Deep Learning0
DANBO: Disentangled Articulated Neural Body Representations via Graph Neural Networks0
Seeding Diversity into AI Art0
Augmentation Techniques Analysis with Removal of Class Imbalance Using PyTorch for Intel Scene Dataset0
Augmented Balanced Image Dataset Generator Using AugStatic LibraryCode0
Improving Model Performance and Removing the Class Imbalance Problem Using Augmentation0
AugStatic - A Light-Weight Image Augmentation LibraryCode0
Deep PCB To COCO ConvertorCode2
Show:102550
← PrevPage 95 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified