SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 35513600 of 6689 papers

TitleStatusHype
Zero-shot Text-guided Infinite Image Synthesis with LLM guidance0
I2AM: Interpreting Image-to-Image Latent Diffusion Models via Attribution Maps0
Towards Understanding Unsafe Video GenerationCode0
Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIsCode0
Flatfish Disease Detection Based on Part Segmentation Approach and Disease Image Generation0
How Control Information Influences Multilingual Text Image Generation and Editing?Code0
Subject-driven Text-to-Image Generation via Preference-based Reinforcement LearningCode0
Beta Sampling is All You Need: Efficient Image Generation Strategy for Diffusion Models using Stepwise Spectral Analysis0
DiNO-Diffusion. Scaling Medical Diffusion via Self-Supervised Pre-Training0
Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse ProblemsCode0
Efficient Training with Denoised Neural Weights0
Mask-guided cross-image attention for zero-shot in-silico histopathologic image generation with a diffusion model0
Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis0
OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting0
Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval0
Optical Diffusion Models for Image Generation0
Physics-Inspired Generative Models in Medical Imaging: A Review0
A Survey of Defenses against AI-generated Visual Media: Detection, Disruption, and Authentication0
ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline ContextCode0
Layout-and-Retouch: A Dual-stage Framework for Improving Diversity in Personalized Image Generation0
FairyLandAI: Personalized Fairy Tales utilizing ChatGPT and DALLE-30
Machine Apophenia: The Kaleidoscopic Generation of Architectural Images0
Surgical Text-to-Image Generation0
E2VIDiff: Perceptual Events-to-Video Reconstruction using Diffusion Priors0
Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation DatasetsCode0
Boosting Medical Image Synthesis via Registration-guided Consistency and Disentanglement Learning0
Sketch-Guided Scene Image Generation0
Accelerating Mobile Edge Generation (MEG) by Constrained Learning0
Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion0
Spanish TrOCR: Leveraging Transfer Learning for Language AdaptationCode0
MMIS: Multimodal Dataset for Interior Scene Visual Generation and Recognition0
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing0
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation0
MobilePortrait: Real-Time One-Shot Neural Head Avatars on Mobile Devices0
Layered Diffusion Model for One-Shot High Resolution Text-to-Image Synthesis0
Rethinking Image Skip Connections in StyleGAN20
An Improved Method for Personalizing Diffusion Models0
Multi-scale Conditional Generative Modeling for Microscopic Image Restoration0
Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models0
PROUD: PaRetO-gUided Diffusion Model for Multi-objective GenerationCode0
DiCTI: Diffusion-based Clothing Designer via Text-guided Input0
Medical Image Fusion for High-Level Analysis: A Mutual Enhancement Framework for Unaligned PAT and MRICode0
Model Collapse in the Self-Consuming Chain of Diffusion Finetuning: A Novel Perspective from Quantitative Trait Modeling0
Lateralization LoRA: Interleaved Instruction Tuning with Modality-Specialized Adaptations0
Representation learning with CGAN for casual inference0
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs0
Mobile Edge Generation-Enabled Digital Twin: Architecture Design and Research OpportunitiesCode0
SwiftDiffusion: Efficient Diffusion Model Serving with Add-on Modules0
UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks0
Label-free Neural Semantic Image Synthesis0
Show:102550
← PrevPage 72 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified