SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 40014050 of 6689 papers

TitleStatusHype
Person image generation with semantic attention network for person re-identification0
Anatomy and Physiology of Artificial Intelligence in PET Imaging0
UniFL: Improve Latent Diffusion Model via Unified Feedback Learning0
Zero-Shot Object Segmentation through Concept Distillation from Generative Image Foundation Models0
An Analysis on Quantizing Diffusion Transformers0
An Analysis of Human Alignment of Latent Diffusion Models0
PhotoGAN: Generative Adversarial Neural Network Acceleration with Silicon Photonics0
Photographic Image Synthesis with Cascaded Refinement Networks0
Analyzing the Latent Space of GAN through Local Dimension Estimation0
Photorealistic Image Synthesis for Object Instance Detection0
Photo-realistic Neural Domain Randomization0
Unifying conditional and unconditional semantic image synthesis with OCO-GAN0
PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models0
Physically-Based Face Rendering for NIR-VIS Face Recognition0
Analyzing Quality, Bias, and Performance in Text-to-Image Generative Models0
Physics-Inspired Generative Models in Medical Imaging: A Review0
Physics Validation of Novel Convolutional 2D Architectures for Speeding Up High Energy Physics Simulations0
PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion0
Analyzing CLIP's Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study0
Picture that Sketch: Photorealistic Image Generation from Abstract Sketches0
PIDiff: Image Customization for Personalized Identities with Diffusion Models0
Analyzing and Improving Model Collapse in Rectified Flow Models0
3D-Aware Indoor Scene Synthesis with Depth Priors0
Pipeline Enabling Zero-shot Classification for Bangla Handwritten Grapheme0
Pipeline Generative Adversarial Networks for Facial Images Generation with Multiple Attributes0
Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer0
Pisces: An Auto-regressive Foundation Model for Image Understanding and Generation0
WikiArt Emotions: An Annotated Dataset of Emotions Evoked by Art0
PIV/BOS Synthetic Image Generation in Variable Density Environments for Error Analysis and Experiment Design0
Analytical Interpretation of Latent Codes in InfoGAN with SAR Images0
UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation0
Analysing Diffusion Segmentation for Medical Images0
An Adversarial Learning Approach to Medical Image Synthesis for Lesion Detection0
"An Adapt-or-Die Type of Situation": Perception, Adoption, and Use of Text-To-Image-Generation AI by Game Industry Professionals0
Adaptive Gradient Regularization: A Faster and Generalizable Optimization Technique for Deep Neural Networks0
UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis0
PixelCNN Models with Auxiliary Variables for Natural Image Modeling0
An Acceleration Framework for High Resolution Image Synthesis0
3D-aware Image Generation using 2D Diffusion Models0
A Mutual Information Perspective on Multiple Latent Variable Generative Models for Positive View Generation0
Pixel Is Not A Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models0
Pixel-Level BPE for Auto-Regressive Image Generation0
Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images0
PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation0
Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction0
UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation0
Pixel-Space Post-Training of Latent Diffusion Models0
UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion0
PixelVAE++: Improved PixelVAE with Discrete Prior0
Pixel-wise Conditioned Generative Adversarial Networks for Image Synthesis and Completion0
Show:102550
← PrevPage 81 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified