SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 49014950 of 6689 papers

TitleStatusHype
Consistent Diffusion: Denoising Diffusion Model with Data-Consistent Training for Image Restoration0
Consistent Image Layout Editing with Diffusion Models0
Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models0
Constrained Generative Adversarial Networks for Interactive Image Generation0
Constrained Image Generation Using Binarized Neural Networks with Decision Procedures0
Constructing Dreams using Generative AI0
Stable Diffusion Exposed: Gender Bias from Prompt to Image0
Content-Aware Preserving Image Generation0
Content-Conditioned Generation of Stylized Free hand Sketches0
ContentV: Efficient Training of Video Generation Models with Limited Compute0
Context-Aware Autoregressive Models for Multi-Conditional Image Generation0
Instance-Aware Image Completion0
Stable Diffusion for Data Augmentation in COCO and Weed Datasets0
Context Diffusion: In-Context Aware Image Generation0
Contextual Chart Generation for Cyber Deception0
Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression0
Video Perception Models for 3D Scene Synthesis0
Stable Diffusion with Continuous-time Neural Network0
Continuation of Famous Art with AI: A Conditional Adversarial Network Inpainting Approach0
StableGarment: Garment-Centric Generation via Stable Diffusion0
Continuous Speech Synthesis using per-token Latent Diffusion0
StableMaterials: Enhancing Diversity in Material Generation via Semi-Supervised Learning0
Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction0
ContRail: A Framework for Realistic Railway Image Synthesis using ControlNet0
Stable Messenger: Steganography for Message-Concealed Image Generation0
ADIR: Adaptive Diffusion for Image Reconstruction0
Stable Rank Normalization for Improved Generalization in Neural Networks and GANs0
ContraGAN: Contrastive Learning for Conditional Image Generation0
Contrastive Learning-Enhanced Trajectory Matching for Small-Scale Dataset Distillation0
Stable Rivers: A Case Study in the Application of Text-to-Image Generative Models for Earth Sciences0
Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models0
Categorical EHR Imputation with Generative Adversarial Nets0
Video Prediction by Modeling Videos as Continuous Multi-Dimensional Processes0
Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields0
Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models0
Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion0
Controllable Human Image Generation with Personalized Multi-Garments0
Controllable Image Generation via Collage Representations0
Controllable Image Generation With Composed Parallel Token Prediction0
Controllable Image Synthesis of Industrial Data Using Stable Diffusion0
Controllable Image Synthesis via SegVAE0
StackGAN: Facial Image Generation Optimizations0
Cross Modality Medical Image Synthesis for Improving Liver Segmentation0
Staging E-Commerce Products for Online Advertising using Retrieval Assisted Image Generation0
Controllable Person Image Synthesis with Pose-Constrained Latent Diffusion0
Controllable Radiance Fields for Dynamic Face Synthesis0
Controllable retinal image synthesis using conditional StyleGAN and latent space manipulation for improved diagnosis and grading of diabetic retinopathy0
Controllable Text-to-Image Generation with GPT-40
ControlVAE: Controllable Variational Autoencoder0
Controlled and Conditional Text to Image Generation with Diffusion Prior0
Show:102550
← PrevPage 99 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified