SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 31013150 of 6689 papers

TitleStatusHype
GLoD: Composing Global Contexts and Local Details in Image Generation0
GLocal: Global Graph Reasoning and Local Structure Transfer for Person Image Generation0
Decoupling Training-Free Guided Diffusion by ADMM0
Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation0
Global-Local Image Perceptual Score (GLIPS): Evaluating Photorealistic Quality of AI-Generated Images0
Global Context with Discrete Diffusion in Vector Quantised Modelling for Image Generation0
Semantically Consistent Person Image Generation0
Illustrious: an Open Advanced Illustration Model0
A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images using a GAN0
GL-GAN: Adaptive Global and Local Bilevel Optimization model of Image Generation0
Decouple-Then-Merge: Towards Better Training for Diffusion Models0
Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models0
Image Aesthetic Reasoning: A New Benchmark for Medical Image Screening with MLLMs0
Glauber Generative Model: Discrete Diffusion Models via Binary Classification0
Decouple-Then-Merge: Finetune Diffusion Models as Multi-Task Learning0
Image Augmentations for GAN Training0
GLASS: Guided Latent Slot Diffusion for Object-Centric Learning0
Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform0
Grand Challenge of 106-Point Facial Landmark Localization0
Iterative Flow Matching -- Path Correction and Gradual Refinement for Enhanced Generative Modeling0
iWarpGAN: Disentangling Identity and Style to Generate Synthetic Iris Images0
Just Noticeable Difference for Machine Perception and Generation of Regularized Adversarial Images with Minimal Perturbation0
Language-Guided Image Tokenization for Generation0
GIU-GANs: Global Information Utilization for Generative Adversarial Networks0
Boosting Unconstrained Face Recognition with Targeted Style Adversary0
Image Denoising: The Deep Learning Revolution and Beyond -- A Survey Paper --0
GIST: Towards Photorealistic Style Transfer via Multiscale Geometric Representations0
Devil is in the Detail: Towards Injecting Fine Details of Image Prompt in Image Generation via Conflict-free Guidance and Stratified Attention0
Decoupled Doubly Contrastive Learning for Cross Domain Facial Action Unit Detection0
Decoupled Classifier-Free Guidance for Counterfactual Diffusion Models0
Image Generation and Learning Strategy for Deep Document Forgery Detection0
Image Generation and Recognition (Emotions)0
Gibbs Sampling with People0
GH-Feat: Learning Versatile Generative Hierarchical Features from GANs0
Decorating Your Own Bedroom: Locally Controlling Image Generation with Generative Adversarial Networks0
Beyond Reconstruction: A Physics Based Neural Deferred Shader for Photo-realistic Rendering0
Anatomy-Grounded Weakly Supervised Prompt Tuning for Chest X-ray Latent Diffusion Models0
GETAvatar: Generative Textured Meshes for Animatable Human Avatars0
Beyond Reality: The Pivotal Role of Generative AI in the Metaverse0
Anatomy and Physiology of Artificial Intelligence in PET Imaging0
Geometry-guided Cross-view Diffusion for One-to-many Cross-view Image Synthesis0
Decompose to manipulate: Manipulable Object Synthesis in 3D Medical Images with Structured Image Decomposition0
Geometry-Aware Satellite-to-Ground Image Synthesis for Urban Areas0
Geometric Regularity in Deterministic Sampling of Diffusion-based Generative Models0
Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models0
Image Generation with Supervised Selection Based on Multimodal Features for Semantic Communications0
Decomposed evaluations of geographic disparities in text-to-image models0
Image Generation with Self Pixel-wise Normalization0
Adaptive Non-Uniform Timestep Sampling for Accelerating Diffusion Model Training0
Geometric Median Matching for Robust k-Subset Selection from Noisy Data0
Show:102550
← PrevPage 63 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified