SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 46014650 of 6689 papers

TitleStatusHype
Trajectory-aware Principal Manifold Framework for Data Augmentation and Image Generation0
Mask-guided Data Augmentation for Multiparametric MRI Generation with a Rare Hepatocellular Carcinoma0
What can Discriminator do? Towards Box-free Ownership Verification of Generative Adversarial NetworkCode0
Beyond Reality: The Pivotal Role of Generative AI in the Metaverse0
Testing the Depth of ChatGPT's Comprehension via Cross-Modal Tasks Based on ASCII-Art: GPT3.5's Abilities in Regard to Recognizing and Generating ASCII-Art Are Not Totally Lacking0
Staging E-Commerce Products for Online Advertising using Retrieval Assisted Image Generation0
Shrink-Perturb Improves Architecture Mixing during Population Based Training for Neural Architecture SearchCode0
TEDi: Temporally-Entangled Diffusion for Long-Term Motion Synthesis0
EqGAN: Feature Equalization Fusion for Few-shot Image Generation0
Deepfake Image Generation for Improved Brain Tumor Segmentation0
VideoControlNet: A Motion-Guided Video-to-Video Translation Framework by Using Diffusion Model with ControlNet0
Learning Disentangled Discrete RepresentationsCode0
Composite Diffusion | whole >= Σparts0
Fake It Without Making It: Conditioned Face Generation for Accurate 3D Face Reconstruction0
XDLM: Cross-lingual Diffusion Language Model for Machine Translation0
Attribute Regularized Soft Introspective VAE: Towards Cardiac Attribute Regularization Through MRI Domains0
PartDiff: Image Super-resolution with Partial Diffusion Models0
Attention Consistency Refined Masked Frequency Forgery Representation for Generalizing Face Forgery DetectionCode0
Progressive distillation diffusion for raw music generation0
Diffusion Sampling with Momentum for Mitigating Divergence ArtifactsCode0
Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis0
Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis0
Text2Layer: Layered Image Generation using Latent Diffusion Model0
A Siamese-based Verification System for Open-set Architecture Attribution of Synthetic ImagesCode0
PromptCrafter: Crafting Text-to-Image Prompt through Mixed-Initiative Dialogue with LLM0
Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation0
Towards Authentic Face Restoration with Iterative Diffusion Models and Beyond0
Conditional 360-degree Image Synthesis for Immersive Indoor Scene Decoration0
Jean-Luc Picard at Touché 2023: Comparing Image Generation, Stance Detection and Feature Matching for Image Retrieval for Arguments0
Survey on Controlable Image Synthesis with Deep Learning0
Creating Image Datasets in Agricultural Environments using DALL.E: Generative AI-Powered Large Language Model0
Unbiased Image Synthesis via Manifold Guidance in Diffusion Models0
Soft Curriculum for Learning Conditional GANs with Noisy-Labeled and Uncurated Unlabeled Data0
Generative adversarial networks for data-scarce spectral applications0
Domain-Agnostic Tuning-Encoder for Fast Personalization of Text-To-Image Models0
Exposing the Fake: Effective Diffusion-Generated Images Detection0
DiffuseGAE: Controllable and High-fidelity Image Manipulation from Disentangled Representation0
Diffusion idea exploration for art generation0
Automatic Generation of Semantic Parts for Face Image SynthesisCode0
DRMC: A Generalist Model with Dynamic Routing for Multi-Center PET Image SynthesisCode0
SAR-NeRF: Neural Radiance Fields for Synthetic Aperture Radar Multi-View Representation0
K-Space-Aware Cross-Modality Score for Synthesized Neuroimage Quality Assessment0
Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback0
Cross-Modality Fourier Feature for Medical Image SynthesisCode0
Score-based Conditional Generation with Fewer Labeled Data by Self-calibrating Classifier Guidance0
DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer0
Measuring the Success of Diffusion Models at Imitating Human Artists0
On the Cultural Gap in Text-to-Image GenerationCode0
Prompting Diffusion Representations for Cross-Domain Semantic SegmentationCode0
SoK: Privacy-Preserving Data Synthesis0
Show:102550
← PrevPage 93 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified