SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 25512600 of 6689 papers

TitleStatusHype
Adversarial Code Learning for Image Generation0
Diffusion Models Generate Images Like Painters: an Analytical Theory of Outline First, Details Later0
Can Shape-Infused Joint Embeddings Improve Image-Conditioned 3D Diffusion?0
Can segmentation models be trained with fully synthetically generated data?0
Temporal Evolution of Knee Osteoarthritis: A Diffusion-based Morphing Model for X-ray Medical Image Synthesis0
Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging0
Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation0
Diffusion models for Gaussian distributions: Exact solutions and Wasserstein errors0
Diffusion Models Enable Zero-Shot Pose Estimation for Lower-Limb Prosthetic Users0
Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models0
Diffusion Models as Data Mining Tools0
Can Generative Geospatial Diffusion Models Excel as Discriminative Geospatial Foundation Models?0
Can Generative AI Replace Immunofluorescent Staining Processes? A Comparison Study of Synthetically Generated CellPainting Images from Brightfield0
Diffusion Model Conditioning on Gaussian Mixture Model and Negative Gaussian Mixture Gradient0
Can CLIP Count Stars? An Empirical Study on Quantity Bias in CLIP0
Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines0
Can Artificial Intelligence Reconstruct Ancient Mosaics?0
AnySynth: Harnessing the Power of Image Synthetic Data Generation for Generalized Vision-Language Tasks0
Diffusion Instruction Tuning0
Causal Adversarial Network for Learning Conditional and Interventional Distributions0
Diffusion idea exploration for art generation0
Adversarial Audio Super-Resolution with Unsupervised Feature Losses0
Absorb and Converge: Provable Convergence Guarantee for Absorbing Discrete Diffusion Models0
Hand1000: Generating Realistic Hands from Text with Only 1,000 Images0
DiffusionGPT: LLM-Driven Text-to-Image Generation System0
CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields0
Camouflaged Image Synthesis Is All You Need to Boost Camouflaged Detection0
Anysize GAN: A solution to the image-warping problem0
Cali-Sketch: Stroke Calibration and Completion for High-Quality Face Image Generation from Human-Like Sketches0
Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images0
Abnormal Colon Polyp Image Synthesis Using Conditional Adversarial Networks for Improved Detection Performance0
CompAlign: Improving Compositional Text-to-Image Generation with a Complex Benchmark and Fine-Grained Feedback0
DiffUMI: Training-Free Universal Model Inversion via Unconditional Diffusion for Face Recognition0
Calibrated Vehicle Paint Signatures for Simulating Hyperspectral Imagery0
CA-GAN: Weakly Supervised Color Aware GAN for Controllable Makeup Transfer0
AnyScene: Customized Image Synthesis with Composited Foreground0
Dilated Spatial Generative Adversarial Networks for Ergodic Image Generation0
CAGAN: Text-To-Image Generation with Combined Attention GANs0
CAFLOW: Conditional Autoregressive Flows0
Adversarial Attacks on Image Generation With Made-Up Words0
HairNeRF: Geometry-Aware Image Synthesis for Hairstyle Transfer0
Halluci-Net: Scene Completion by Exploiting Object Co-occurrence Relationships0
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances0
DiffusionBlocks: Blockwise Training for Generative Models via Score-Based Diffusion0
CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling0
Diffusion Beats Autoregressive: An Evaluation of Compositional Generation in Text-to-Image Models0
Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler0
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status0
CacheQuant: Comprehensively Accelerated Diffusion Models0
Cache Me if You Can: Accelerating Diffusion Models through Block Caching0
Show:102550
← PrevPage 52 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified