SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 35013550 of 6689 papers

TitleStatusHype
Transformer-based Cross-Modal Recipe Embeddings with Large Batch Training0
Medical Image Generation using Generative Adversarial Networks0
Medical Image Segmentation on MRI Images with Missing Modalities: A Review0
Medical Image Segmentation Using a U-Net type of Architecture0
Medical Image Synthesis for Data Augmentation and Anonymization using Generative Adversarial Networks0
Medical Image Synthesis via Fine-Grained Image-Text Alignment and Anatomy-Pathology Prompting0
Medical Image Synthesis with Context-Aware Generative Adversarial Networks0
Are Conditional Latent Diffusion Models Effective for Image Restoration?0
Mediffusion: Joint Diffusion for Self-Explainable Semi-Supervised Classification and Medical Image Generation0
Transformer-based Generative Adversarial Networks in Computer Vision: A Comprehensive Survey0
MediSyn: Text-Guided Diffusion Models for Broad Medical 2D and 3D Image Synthesis0
What Does DALL-E 2 Know About Radiology?0
Transformer for Times Series: an Application to the S&P5000
Are conditional GANs explicitly conditional?0
MedSegDiffNCA: Diffusion Models With Neural Cellular Automata for Skin Lesion Segmentation0
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos0
Transformers in Medical Image Analysis: A Review0
MEGANE: Morphable Eyeglass and Avatar Network0
MEGAN: Mixture of Experts of Generative Adversarial Networks for Multimodal Image Generation0
Megapixel Size Image Creation using Generative Adversarial Networks0
ArcGAN: Generative Adversarial Networks for 3D Architectural Image Generation0
MelanoGANs: High Resolution Skin Lesion Synthesis with GANs0
Transforming Image Generation from Scene Graphs0
Transform the Set: Memory Attentive Generation of Guided and Unguided Image Collages0
Membership Attacks on Conditional Generative Models Using Image Difficulty0
Membership Inference Attacks Against Text-to-image Generation Models0
Membership Inference Attacks on Diffusion Models via Quantile Regression0
What If We Recaption Billions of Web Images with LLaMA-3?0
Memories of Forgotten Concepts0
Memory-Driven Text-to-Image Generation0
Memory-Efficient 3D High-Resolution Medical Image Synthesis Using CRF-Guided GANs0
Memory-efficient GAN-based Domain Translation of High Resolution 3D Medical Images0
Memory Triggers: Unveiling Memorization in Text-To-Image Generative Models through Word-Level Duplication0
Translate the Facial Regions You Like Using Region-Wise Normalization0
Translating and Segmenting Multimodal Medical Volumes with Cycle- and Shape-Consistency Generative Adversarial Network0
Message Passing Multi-Agent GANs0
MEt3R: Measuring Multi-View Consistency in Generated Images0
MetaCLUE: Towards Comprehensive Visual Metaphors Research0
Translation-Enhanced Multilingual Text-to-Image Generation0
MetaDecorator: Generating Immersive Virtual Tours through Multimodality0
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation0
MetaHead: An Engine to Create Realistic Digital Head0
TR at SemEval-2020 Task 4: Exploring the Limits of Language-model-based Common Sense Validation0
Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments0
Meta-Learning Divergences of Variational Inference0
MetalGAN: Multi-Domain Label-Less Image Synthesis Using cGANs and Meta-Learning0
Method and Software Tool for Generating Artificial Databases of Biomedical Images Based on Deep Neural Networks0
What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale0
Metrics to Quantify Global Consistency in Synthetic Medical Images0
TR-DQ: Time-Rotation Diffusion Quantization0
Show:102550
← PrevPage 71 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified