SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 41514200 of 6689 papers

TitleStatusHype
2D GANs Meet Unsupervised Single-view 3D Reconstruction0
Progressive Retinex: Mutually Reinforced Illumination-Noise Perception Network for Low Light Image Enhancement0
Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Comprehension and Generation0
Progress Towards Decoding Visual Imagery via fNIRS0
Rethinking Training for De-biasing Text-to-Image Generation: Unlocking the Potential of Stable Diffusion0
Projection image-to-image translation in hybrid X-ray/MR imaging0
Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation0
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis0
Prompt Augmentation for Self-supervised Text-guided Image Manipulation0
Unlocking Spatial Comprehension in Text-to-Image Diffusion Models0
PromptCrafter: Crafting Text-to-Image Prompt through Mixed-Initiative Dialogue with LLM0
ForgetMe: Evaluating Selective Forgetting in Generative Models0
A Taxonomy of Prompt Modifiers for Text-To-Image Generation0
Prompt Expansion for Adaptive Text-to-Image Generation0
ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis0
A LoRA is Worth a Thousand Pictures0
PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models0
Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models0
Prompting Forgetting: Unlearning in GANs via Textual Guidance0
Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models0
Prompt Mechanisms in Medical Imaging: A Comprehensive Survey0
PromptMobile: Efficient Promptus for Low Bandwidth Mobile Video Streaming0
Prompt Performance Prediction for Image Generation0
Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models0
Prompt Recovery for Image Generation Models: A Comparative Study of Discrete Optimizers0
World Knowledge from AI Image Generation for Robot Control0
Unlocking Visual Secrets: Inverting Features with Diffusion Priors for Image Reconstruction0
Prompt-Softbox-Prompt: A free-text Embedding Control for Image Editing0
Unmasking the Canvas: A Dynamic Benchmark for Image Generation Jailbreaking and LLM Content Safety0
All but One: Surgical Concept Erasing with Model Preservation in Text-to-Image Diffusion Models0
Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models0
Unmixing Before Fusion: A Generalized Paradigm for Multi-Source-based Hyperspectral Image Synthesis0
ProstateGAN: Mitigating Data Bias via Prostate Diffusion Imaging Synthesis with Generative Adversarial Networks0
Unpaired Brain MR-to-CT Synthesis using a Structure-Constrained CycleGAN0
Protecting Intellectual Property of Generative Adversarial Networks From Ambiguity Attacks0
Protective Perturbations against Unauthorized Data Usage in Diffusion-based Image Generation0
Prototype memory and attention mechanisms for few shot image generation0
Provably Secure Robust Image Steganography via Cross-Modal Error Correction0
Proxy-Tuning: Tailoring Multimodal Autoregressive Models for Subject-Driven Image Generation0
Align Your Flow: Scaling Continuous-Time Flow Map Distillation0
Unpaired Cartoon Image Synthesis via Gated Cycle Mapping0
Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models0
Unpaired Deblurring via Decoupled Diffusion Model0
Alignment-free HDR Deghosting with Semantics Consistent Transformer0
Aligning Text-to-Image Models using Human Feedback0
Pushing Auto-regressive Models for 3D Shape Generation at Capacity and Scalability0
Pyramidal Denoising Diffusion Probabilistic Models0
(Un)paired signal-to-signal translation with 1D conditional GANs0
Aligning Latent Spaces with Flow Priors0
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation0
Show:102550
← PrevPage 84 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified