SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 39013950 of 6689 papers

TitleStatusHype
Illiterate DALLE Learns to Compose0
Three Dimensional MR Image Synthesis with Progressive Generative Adversarial Networks0
Augmentation Techniques Analysis with Removal of Class Imbalance Using PyTorch for Intel Scene Dataset0
ILLUME: Illuminating Your LLMs to See, Draw, and Self-Enhance0
Illumination Variation Correction Using Image Synthesis For Unsupervised Domain Adaptive Person Re-Identification0
Illustrious: an Open Advanced Illustration Model0
VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers0
Augmentation-Interpolative AutoEncoders for Unsupervised Few-Shot Image Generation0
Image2Text2Image: A Novel Framework for Label-Free Evaluation of Image-to-Text Generation with Text-to-Image Diffusion Models0
Image Aesthetic Reasoning: A New Benchmark for Medical Image Screening with MLLMs0
Audio Latent Space Cartography0
Image Anything: Towards Reasoning-coherent and Training-free Multi-modal Image Generation0
Image Augmentations for GAN Training0
Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function0
Image-based model parameter optimization using Model-Assisted Generative Adversarial Networks0
TIDE : Temporal-Aware Sparse Autoencoders for Interpretable Diffusion Transformers in Image Generation0
TIE: Revolutionizing Text-based Image Editing for Complex-Prompt Following and High-Fidelity Editing0
Image Compression with Product Quantized Masked Image Modeling0
Image Denoising: The Deep Learning Revolution and Beyond -- A Survey Paper --0
Audio Dequantization for High Fidelity Audio Generation in Flow-based Neural Vocoder0
Image Generation and Editing with Variational Info Generative AdversarialNetworks0
Image Generation and Learning Strategy for Deep Document Forgery Detection0
Image Generation and Recognition (Emotions)0
Image Generation and Translation with Disentangled Representations0
Deformable 3D Shape Diffusion Model0
Tiled Diffusion0
Image Generation from Contextually-Contradictory Prompts0
Image Generation from Image Captioning -- Invertible Approach0
TimbreCLIP: Connecting Timbre to Text and Images0
Time Cell Inspired Temporal Codebook in Spiking Neural Networks for Enhanced Image Generation0
Image Generation Network for Covert Transmission in Online Social Network0
Image Generation using Continuous Filter Atoms0
Image Generation with Multimodal Priors using Denoising Diffusion Probabilistic Models0
Image Generation with Supervised Selection Based on Multimodal Features for Semantic Communications0
Time flies by: Analyzing the Impact of Face Ageing on the Recognition Performance with Synthetic Data0
Image Generation with Self Pixel-wise Normalization0
Time Series Diffusion Method: A Denoising Diffusion Probabilistic Model for Vibration Signal Generation0
Image Generative Semantic Communication with Multi-Modal Similarity Estimation for Resource-Limited Networks0
Time Series Generative Learning with Application to Brain Imaging Analysis0
Image-Guided Microstructure Optimization using Diffusion Models: Validated with Li-Mn-rich Cathode Precursors0
Image-guided Neural Object Rendering0
Image Inpainting Models are Effective Tools for Instruction-guided Image Editing0
A Two Stage GAN for High Resolution Retinal Image Generation and Segmentation0
Imagen Video: High Definition Video Generation with Diffusion Models0
ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation0
Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models0
Accelerating Score-based Generative Models for High-Resolution Image Synthesis0
ZeroAvatar: Zero-shot 3D Avatar Generation from a Single Image0
Accelerating Mobile Edge Generation (MEG) by Constrained Learning0
Image Super-Resolution With Deep Variational Autoencoders0
Show:102550
← PrevPage 79 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified