SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 46014650 of 6689 papers

TitleStatusHype
Separable Multi-Concept Erasure from Diffusion Models0
Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry0
Sequential Attention GAN for Interactive Image Editing0
Sequential Posterior Sampling with Diffusion Models0
Sequential-Scanning Dual-Energy CT Imaging Using High Temporal Resolution Image Reconstruction and Error-Compensated Material Basis Image Generation0
Sequential Semantic Generative Communication for Progressive Text-to-Image Generation0
SerialGen: Personalized Image Generation by First Standardization Then Personalization0
VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models0
se-Shweshwe Inspired Fashion Generation0
SETGAN: Scale and Energy Trade-off GANs for Image Applications on Mobile Platforms0
VectorFusion: Text-to-SVG by Abstracting Pixel-Based Diffusion Models0
SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance0
Vector Graphics Generation via Mutually Impulsed Dual-domain Diffusion0
SGM-Net: Semantic Guided Matting Net0
Unsupervised Image Decomposition in Vector Layers0
Vector Learning for Cross Domain Representations0
Shape-conditioned Image Generation by Learning Latent Appearance Representation from Unpaired Data0
Shape-Preserving Generation of Food Images for Automatic Dietary Assessment0
Shapes and Context: In-the-Wild Image Synthesis & Manipulation0
ShapeWords: Guiding Text-to-Image Synthesis with 3D Shape-Aware Prompts0
Sharp-GAN: Sharpness Loss Regularized GAN for Histopathology Image Synthesis0
ShieldGemma 2: Robust and Tractable Image Content Moderation0
ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories0
AnonymousNet: Natural Face De-Identification with Measurable Privacy0
ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model0
Adversarial Domain Prompt Tuning and Generation for Single Domain Generalization0
SHYI: Action Support for Contrastive Learning in High-Fidelity Text-to-Image Generation0
SIAN: Style-Guided Instance-Adaptive Normalization for Multi-Organ Histopathology Image Synthesis0
Adversarial cycle-consistent synthesis of cerebral microbleeds for data augmentation0
SideGAN: 3D-Aware Generative Model for Improved Side-View Image Synthesis0
SIDGAN: High-Resolution Dubbed Video Generation via Shift-Invariant Learning0
SIDME: Self-supervised Image Demoiréing via Masked Encoder-Decoder Reconstruction0
SIGAN: A Novel Image Generation Method for Solar Cell Defect Segmentation and Augmentation0
SILMM: Self-Improving Large Multimodal Models for Compositional Text-to-Image Generation0
Yo'Chameleon: Personalized Vision and Language Generation0
SimCVD: Simple Contrastive Voxel-Wise Representation Distillation for Semi-Supervised Medical Image Segmentation0
Similarity and Quality Metrics for MR Image-To-Image Translation0
SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing0
Adversarial Code Learning for Image Generation0
Adversarial Audio Super-Resolution with Unsupervised Feature Losses0
Simple Drop-in LoRA Conditioning on Attention Layers Will Improve Your Diffusion Model0
Simpler Diffusion: 1.5 FID on ImageNet512 with Pixel-space Diffusion0
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion0
Simple Transparent Adversarial Examples0
Simple U-net Based Synthetic Polyp Image Generation: Polyp to Negative and Negative to Polyp0
Simplex Autoencoders0
Simplifying Models with Unlabeled Output Data0
Simulation of an Elevator Group Control Using Generative Adversarial Networks and Related AI Tools0
Simultaneous Multiple-Prompt Guided Generation Using Differentiable Optimal Transport0
Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images using Weakly-Supervised Joint Convolutional Sparse Coding0
Show:102550
← PrevPage 93 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified