SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 39013950 of 6689 papers

TitleStatusHype
Multi-Sensor Diffusion-Driven Optical Image Translation for Large-Scale Applications0
Optimal Budgeted Rejection Sampling for Generative Models0
Optimal Stochastic Trace Estimation in Generative Modeling0
Optimized latent-code selection for explainable conditional text-to-image GANs0
Optimizing Few-Step Diffusion Samplers by Gradient Descent0
Optimizing Knowledge Distillation in Transformers: Enabling Multi-Head Attention without Alignment Barriers0
Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation0
3D-LDM: Neural Implicit 3D Shape Generation with Latent Diffusion Models0
Orchid: Image Latent Diffusion for Joint Appearance and Geometry Generation0
Zoom, Enhance! Measuring Surveillance GAN Up-sampling0
OrganiQ: Mitigating Classical Resource Bottlenecks of Quantum Generative Adversarial Networks on NISQ-Era Machines0
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation0
OrthoGAN:High-Precision Image Generation for Teeth Orthodontic Visualization0
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation0
OT-Net: A Reusable Neural Optimal Transport Solver0
One-shot Ultra-high-Resolution Generative Adversarial Network That Synthesizes 16K Images On A Single GPU0
A New Dataset and Boundary-Attention Semantic Segmentation for Face Parsing0
Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models0
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer0
A new approach for encoding code and assisting code understanding0
Owls are wise and foxes are unfaithful: Uncovering animal stereotypes in vision-language models0
Ownership Protection of Generative Adversarial Networks0
P3S-Diffusion:A Selective Subject-driven Generation Framework via Point Supervision0
3D-GIF: 3D-Controllable Object Generation via Implicit Factorized Representations0
PA-GAN: Improving GAN Training by Progressive Augmentation0
UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning0
PAID: A Framework of Product-Centric Advertising Image Design0
PAI-Diffusion: Constructing and Serving a Family of Open Chinese Diffusion Models for Text-to-image Synthesis on the Cloud0
Whole-Body Image-to-Image Translation for a Virtual Scanner in a Healthcare Digital Twin0
Unified Autoregressive Visual Generation and Understanding with Continuous Tokens0
An Ensemble Approach for Brain Tumor Segmentation and Synthesis0
Painter: Teaching Auto-regressive Language Models to Draw Sketches0
Painting 3D Nature in 2D: View Synthesis of Natural Scenes from a Single Semantic Mask0
Paint it Black: Generating paintings from text descriptions0
Pal-GAN: Palette-conditioned Generative Adversarial Networks0
PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion0
3DGH: 3D Head Generation with Composable Hair and Face0
Panoptic-based Image Synthesis0
Panoptic-aware Image-to-Image Translation0
Panoptic Diffusion Models: co-generation of images and segmentation maps0
Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model0
ParaGAN: A Scalable Distributed Training Framework for Generative Adversarial Networks0
Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields0
Parallel Multiscale Autoregressive Density Estimation0
Parallel Optimal Transport GAN0
Parallel Recurrent Data Augmentation for GAN training with Limited and Diverse Data0
Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection0
Parallel Scheduled Sampling0
Parallel Sequence Modeling via Generalized Spatial Propagation Network0
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity0
Show:102550
← PrevPage 79 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified