SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 35513575 of 6689 papers

TitleStatusHype
Zero-shot Text-guided Infinite Image Synthesis with LLM guidance0
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Attribution Maps0
Towards Understanding Unsafe Video GenerationCode0
Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIsCode0
Flatfish Disease Detection Based on Part Segmentation Approach and Disease Image Generation0
How Control Information Influences Multilingual Text Image Generation and Editing?Code0
Subject-driven Text-to-Image Generation via Preference-based Reinforcement LearningCode0
Beta Sampling is All You Need: Efficient Image Generation Strategy for Diffusion Models using Stepwise Spectral Analysis0
DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training0
Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse ProblemsCode0
Efficient Training with Denoised Neural Weights0
Mask-guided cross-image attention for zero-shot in-silico histopathologic image generation with a diffusion model0
Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis0
OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting0
Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval0
Optical Diffusion Models for Image Generation0
Physics-Inspired Generative Models in Medical Imaging: A Review0
A Survey of Defenses against AI-generated Visual Media: Detection, Disruption, and Authentication0
ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline ContextCode0
Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation0
FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-30
Machine Apophenia: The Kaleidoscopic Generation of Architectural Images0
Surgical Text-to-Image Generation0
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors0
Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation DatasetsCode0
Show:102550
← PrevPage 143 of 268Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified