SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 49515000 of 6689 papers

TitleStatusHype
Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model0
ParaGAN: A Scalable Distributed Training Framework for Generative Adversarial Networks0
Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields0
Parallel Multiscale Autoregressive Density Estimation0
Parallel Optimal Transport GAN0
Parallel Recurrent Data Augmentation for GAN training with Limited and Diverse Data0
Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection0
Parallel Scheduled Sampling0
Parallel Sequence Modeling via Generalized Spatial Propagation Network0
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity0
Parametric Implicit Face Representation for Audio-Driven Facial Reenactment0
Parasite: A Steganography-based Backdoor Attack Framework for Diffusion Models0
PARASOL: Parametric Style Control for Diffusion Image Synthesis0
PaReNeRF: Toward Fast Large-scale Dynamic NeRF with Patch-based Reference0
PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models0
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation0
PartDiff: Image Super-resolution with Partial Diffusion Models0
Partially Conditioned Generative Adversarial Networks0
Partially Conditioned Patch Parallelism for Accelerated Diffusion Model Inference0
Unified framework for diffusion generative models in SO(3): applications in computer vision and astrophysics0
Unified High-binding Watermark for Unconditional Image Generation Models0
PassFlow: Guessing Passwords with Generative Flows0
PASTA-GAN++: A Versatile Framework for High-Resolution Unpaired Virtual Try-on0
Paste, Inpaint and Harmonize via Denoising: Subject-Driven Image Editing with Pre-Trained Diffusion Model0
Patch Correspondences for Interpreting Pixel-level CNNs0
3DGen-Bench: Comprehensive Benchmark Suite for 3D Generative Models0
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action0
Patch-enhanced Mask Encoder Prompt Image Generation0
Patch-GAN Transfer Learning with Reconstructive Models for Cloud Removal0
Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception0
Patchwise Generative ConvNet: Training Energy-Based Models From a Single Natural Image for Internal Learning0
PATE-TripleGAN: Privacy-Preserving Image Synthesis with Gaussian Differential Privacy0
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
Pathological Retinal Region Segmentation From OCT Images Using Geometric Relation Based Augmentation0
3DGEN: A GAN-based approach for generating novel 3D models from image data0
Pattern Detection in the Activation Space for Identifying Synthesized Content0
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models0
3DGAUnet: 3D generative adversarial networks with a 3D U-Net based generator to achieve the accurate and effective synthesis of clinical tumor image data for pancreatic cancer0
PeaceGAN: A GAN-based Multi-Task Learning Method for SAR Target Image Generation with a Pose Estimator and an Auxiliary Classifier0
Unified Multi-Modal Image Synthesis for Missing Modality Imputation0
DuelGAN: A Duel Between Two Discriminators Stabilizes the GAN Training0
Peer is Your Pillar: A Data-unbalanced Conditional GANs for Few-shot Image Generation0
Unified Multi-Modal Latent Diffusion for Joint Subject and Text Conditional Image Generation0
PerceptionGAN: Real-world Image Construction from Provided Text through Perceptual Understanding0
Perceptions and Realities of Text-to-Image Generation0
Perceptual Artifacts Localization for Image Synthesis Tasks0
Performance Evaluation of Deep Generative Models for Generating Hand-Written Character Images0
Per Garment Capture and Synthesis for Real-time Virtual Try-on0
An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation0
PersonaCraft: Personalized Full-Body Image Synthesis for Multiple Identities from Single References Using 3D-Model-Conditioned Diffusion0
Show:102550
← PrevPage 100 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified