SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 23012350 of 6689 papers

TitleStatusHype
Multi-method Integration with Confidence-based Weighting for Zero-shot Image Classification0
Defect Image Sample Generation With Diffusion Prior for Steel Surface Defect Recognition0
Improving Subject-Driven Image Synthesis with Subject-Agnostic Guidance0
Long Tail Image Generation Through Feature Space Augmentation and Iterated LearningCode0
Towards Inclusive Face Recognition Through Synthetic Ethnicity Alteration0
SonicDiffusion: Audio-Driven Image Generation and Editing with Pretrained Diffusion Models0
Guided Conditional Diffusion Classifier (ConDiff) for Enhanced Prediction of Infection in Diabetic Foot Ulcers0
Compressive Sensing Imaging Using Caustic Lens Mask Generated by Periodic Perturbation in a Ripple Tank0
RGBX: Image decomposition and synthesis using material- and lighting-aware diffusion models0
Streamlining Image Editing with Layered Diffusion Brushes0
UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration EnhancementCode0
TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion ModelsCode1
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation0
NeRF-Insert: 3D Local Editing with Multimodal Control Signals0
DOCCI: Descriptions of Connected and Contrasting Images0
SwipeGANSpace: Swipe-to-Compare Image Generation via Efficient Latent Space Exploration0
FlexiFilm: Long Video Generation with Flexible ConditionsCode1
SIDBench: A Python Framework for Reliably Assessing Synthetic Image Detection MethodsCode2
Anywhere: A Multi-Agent Framework for User-Guided, Reliable, and Diverse Foreground-Conditioned Image Generation0
G-Refine: A General Quality Refiner for Text-to-Image GenerationCode1
PKU-AIGIQA-4K: A Perceptual Quality Assessment Database for Both Text-to-Image and Image-to-Image AI-Generated ImagesCode0
Learning Mixtures of Gaussians Using Diffusion Models0
A robust and scalable framework for hallucination detection in virtual tissue staining and digital pathology0
Hide and Seek: How Does Watermarking Impact Face Recognition?0
TheaterGen: Character Management with LLM for Consistent Multi-turn Image GenerationCode2
Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model0
Improving Training-free Conditional Diffusion Model via Fisher Information0
Causal Diffusion Autoencoders: Toward Counterfactual Generation via Diffusion Probabilistic ModelsCode1
LM-IGTD: a 2D image generator for low-dimensional and mixed-type tabular data to leverage the potential of convolutional neural networks0
Efficient Exploration of Image Classifier Failures with Bayesian Optimization and Text-to-Image Models0
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models0
Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection0
Synthesizing Iris Images using Generative Adversarial Networks: Survey and Comparative Analysis0
REBEL: Reinforcement Learning via Regressing Relative RewardsCode2
Denoising: from classical methods to deep CNNsCode1
Conditional Distribution Modelling for Few-Shot Image Synthesis with Diffusion Models0
MuseumMaker: Continual Style Customization without Catastrophic Forgetting0
PuLID: Pure and Lightning ID Customization via Contrastive AlignmentCode7
Sketch2Human: Deep Human Generation with Disentangled Geometry and Appearance Control0
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning0
GLoD: Composing Global Contexts and Local Details in Image Generation0
FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction0
From Parts to Whole: A Unified Reference Framework for Controllable Human Image GenerationCode2
SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models0
Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation0
MultiBooth: Towards Generating All Your Concepts in an Image from TextCode2
Accelerating Image Generation with Sub-path Linear Approximation Model0
Towards Better Text-to-Image Generation Alignment via Attention Modulation0
SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and GenerationCode4
ArtNeRF: A Stylized Neural Field for 3D-Aware Cartoonized Face SynthesisCode1
Show:102550
← PrevPage 47 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified