SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 42514300 of 6689 papers

TitleStatusHype
RealisID: Scale-Robust and Fine-Controllable Identity Customization via Local and Global Complementation0
Realistic Endoscopic Image Generation Method Using Virtual-to-real Image-domain Translation0
Edit360: 2D Image Edits to 3D Assets from Any Angle0
Unsupervised Cross-Modal Synthesis of Subject-Specific Scans0
Realistic Image Generation using Region-phrase Attention0
Realistic Image Synthesis with Configurable 3D Scene Layouts0
Realistic River Image Synthesis using Deep Generative Adversarial Networks0
Realistic Ultrasound Image Synthesis for Improved Classification of Liver Disease0
Reality's Canvas, Language's Brush: Crafting 3D Avatars from Monocular Video0
RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning0
Real-World Denoising via Diffusion Model0
Unsupervised Domain Adaptation via Disentangled Representations: Application to Cross-Modality Liver Segmentation0
Unsupervised Domain Transfer with Conditional Invertible Neural Networks0
A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models0
Rebooting ACGAN: Auxiliary Classifier GANs with Stable Training0
Recent Progress of Face Image Synthesis0
Recommendation with Generative Models0
Reconciling Semantic Controllability and Diversity for Remote Sensing Image Synthesis with Hybrid Semantic Embedding0
Reconstructing the Noise Manifold for Image Denoising0
Reconstructing the Noise Variance Manifold for Image Denoising0
Unsupervised evaluation of GAN sample quality: Introducing the TTJac Score0
Towards Device Efficient Conditional Image Generation0
Unsupervised Generative 3D Shape Learning from Natural Images0
ReCo: Region-Controlled Text-to-Image Generation0
Recovering Global Data Distribution Locally in Federated Learning0
AIwriting: Relations Between Image Generation and Digital Writing0
RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification0
Red blood cell image generation for data augmentation using Conditional Generative Adversarial Networks0
REDEditing: Relationship-Driven Precise Backdoor Poisoning on Text-to-Image Diffusion Models0
Unsupervised High-Fidelity Facial Texture Generation and Reconstruction0
Redes Generativas Adversarias (GAN) Fundamentos Teóricos y Aplicaciones0
Re-designing cities with conditional adversarial networks0
Unsupervised Histopathology Image Synthesis0
Model Collapse in the Self-Consuming Chain of Diffusion Finetuning: A Novel Perspective from Quantitative Trait Modeling0
ReDirTrans: Latent-to-Latent Translation for Gaze and Head Redirection0
ReDistill: Residual Encoded Distillation for Peak Memory Reduction0
Red-Teaming the Stable Diffusion Safety Filter0
Unsupervised Image-generation Enhanced Adaptation for Object Detection in Thermal images0
ReelFramer: Human-AI Co-Creation for News-to-Video Translation0
Interactive Image Generation Using Scene Graphs0
RefDrop: Controllable Consistency in Image or Video Generation via Reference Feature Guidance0
Referee Can Play: An Alternative Approach to Conditional Generation via Model Inversion0
Reference-Based 3D-Aware Image Editing with Triplanes0
Reference-based Variational Autoencoders0
RefineNet: Enhancing Text-to-Image Conversion with High-Resolution and Detail Accuracy through Hierarchical Transformers and Progressive Refinement0
Unsupervised Image Registration Towards Enhancing Performance and Explainability in Cardiac And Brain Image Analysis0
Grand Challenge of 106-Point Facial Landmark Localization0
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation0
AI's Blind Spots: Geographic Knowledge and Diversity Deficit in Generated Urban Scenario0
Unsupervised Image Transformation Learning via Generative Adversarial Networks0
Show:102550
← PrevPage 86 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified