SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 62016250 of 6689 papers

TitleStatusHype
Progressive and Selective Fusion Network for High Dynamic Range ImagingCode0
End-to-end Sketch-Guided Path Planning through Imitation Learning for Autonomous Mobile RobotsCode0
GD-VDM: Generated Depth for better Diffusion-based Video GenerationCode0
Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image ModelsCode0
Navigating the Synthetic Realm: Harnessing Diffusion-based Models for Laparoscopic Text-to-Image GenerationCode0
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story ContinuationCode0
Batch-Instructed Gradient for Prompt Evolution:Systematic Prompt Optimization for Enhanced Text-to-Image SynthesisCode0
Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image GeneratorCode0
DiffPAD: Denoising Diffusion-based Adversarial Patch DecontaminationCode0
Towards SFW sampling for diffusion models via external conditioningCode0
Towards Small Object Editing: A Benchmark Dataset and A Training-Free ApproachCode0
GeneGAN: Learning Object Transfiguration and Attribute Subspace from Unpaired DataCode0
Adv-KD: Adversarial Knowledge Distillation for Faster Diffusion SamplingCode0
Sparse Sinkhorn AttentionCode0
Towards the Automatic Anime Characters Creation with Generative Adversarial NetworksCode0
Latent Diffusion, Implicit Amplification: Efficient Continuous-Scale Super-Resolution for Remote Sensing ImagesCode0
NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion ModelsCode0
CapsGAN: Using Dynamic Routing for Generative Adversarial NetworksCode0
UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense DisambiguationCode0
Adversarial Training of Variational Auto-encoders for High Fidelity Image GenerationCode0
Generate, Segment and Refine: Towards Generic Manipulation SegmentationCode0
Swin DiT: Diffusion Transformer using Pseudo Shifted WindowsCode0
Latent Feature and Attention Dual Erasure Attack against Multi-View Diffusion Models for 3D Assets ProtectionCode0
Latent Flow TransformerCode0
Generating Annotated High-Fidelity Images Containing Multiple Coherent ObjectsCode0
Scaling up GANs for Text-to-Image SynthesisCode0
LatentKeypointGAN: Controlling Images via Latent KeypointsCode0
DAMM-Diffusion: Learning Divergence-Aware Multi-Modal Diffusion Model for Nanoparticles Distribution PredictionCode0
Zero-Shot Adaptation for Approximate Posterior Sampling of Diffusion Models in Inverse ProblemsCode0
When Relation Networks meet GANs: Relation GANs with Triplet LossCode0
Generating Gameplay-Relevant Art Assets with Transfer LearningCode0
Disentangling Mean Embeddings for Better Diagnostics of Image GeneratorsCode0
A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraintsCode0
Generating Illustrated InstructionsCode0
ProjectedEx: Enhancing Generation in Explainable AI for Prostate CancerCode0
Generating Images from Captions with AttentionCode0
Spatially Constrained GAN for Face and Fashion SynthesisCode0
Generating Images of the M87* Black Hole Using GANsCode0
Denoising and Regularization via Exploiting the Structural Bias of Convolutional GeneratorsCode0
Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion ModelsCode0
Generating Images with Perceptual Similarity Metrics based on Deep NetworksCode0
3-D PET Image Generation with tumour masks using TGANCode0
Generating Multiple Objects at Spatially Distinct LocationsCode0
Latent Space Factorisation and Manipulation via Matrix Subspace ProjectionCode0
Textual Aesthetics in Large Language ModelsCode0
Generating Realistic Forehead-Creases for User Verification via Conditioned Piecewise Polynomial CurvesCode0
Latent Space is Feature Space: Regularization Term for GANs Training on Limited DatasetCode0
Promptable Counterfactual Diffusion Model for Unified Brain Tumor Segmentation and Generation with MRIsCode0
Generating Steganographic Images via Adversarial TrainingCode0
Generating Synthetic Data for Text RecognitionCode0
Show:102550
← PrevPage 125 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified