SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 30513100 of 6689 papers

TitleStatusHype
Enhancing Image Generation Fidelity via Progressive PromptsCode0
Enhancing GANs with MMD Neural Architecture Search, PMish Activation Function, and Adaptive Rank DecompositionCode0
Improving Compositional Generation with Diffusion Models Using Lift ScoresCode0
Improving Explicit Spatial Relationships in Text-to-Image Generation through an Automatically Derived DatasetCode0
Improving MMD-GAN Training with Repulsive Loss FunctionCode0
Enhancing GAN Performance through Neural Architecture Search and Tensor DecompositionCode0
Improved Variational Inference with Inverse Autoregressive FlowCode0
ZstGAN: An Adversarial Approach for Unsupervised Zero-Shot Image-to-Image TranslationCode0
Enhancing Diffusion Models Efficiency by Disentangling Total-Variance and Signal-to-Noise RatioCode0
Indonesian Text-to-Image Synthesis with Sentence-BERT and FastGANCode0
Enhancing Diffusion-Based Image Synthesis with Robust Classifier GuidanceCode0
Image Synthesis with a Single (Robust) ClassifierCode0
Improved ArtGAN for Conditional Synthesis of Natural Image and ArtworkCode0
Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based DiscriminationCode0
Improved Conditional Flow Models for Molecule to Image SynthesisCode0
Enhancing Conditional Image Generation with Explainable Latent Space ManipulationCode0
Implicit Generative CopulasCode0
Cross-Modality Neuroimage Synthesis: A SurveyCode0
Design a Delicious Lunchbox in StyleCode0
Hyperspectral Blind Unmixing using a Double Deep Image PriorCode0
Implicit Inversion turns CLIP into a DecoderCode0
Im-Promptu: In-Context Composition from Image PromptsCode0
Implicit competitive regularization in GANsCode0
Implementing Adaptations for Vision AutoRegressive ModelCode0
Implementing and Experimenting with Diffusion Models for Text-to-Image GenerationCode0
Implicit Dynamical Flow Fusion (IDFF) for Generative ModelingCode0
Composition and Deformance: Measuring Imageability with a Text-to-Image ModelCode0
IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image ModelsCode0
Implicit Generation and Modeling with Energy Based ModelsCode0
Image Translation for Medical Image Generation -- Ischemic Stroke LesionsCode0
Composite Functional Gradient Learning of Generative Adversarial ModelsCode0
Energy-Based Prior Latent Space Diffusion model for Reconstruction of Lumbar Vertebrae from Thick Slice MRICode0
Image Synthesis in Multi-Contrast MRI with Conditional Generative Adversarial NetworksCode0
Energy-Calibrated VAE with Test Time Free LunchCode0
End-to-end Sketch-Guided Path Planning through Imitation Learning for Autonomous Mobile RobotsCode0
A Gauss-Newton Approach for Min-Max Optimization in Generative Adversarial NetworksCode0
Images Speak Volumes: User-Centric Assessment of Image Generation for Accessible CommunicationCode0
Image Style Transfer Using Convolutional Neural NetworksCode0
A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraintsCode0
SketchFlex: Facilitating Spatial-Semantic Coherence in Text-to-Image Generation with Region-Based SketchesCode0
DRAN: Detailed Region-Adaptive Normalization for Conditional Image SynthesisCode0
Imaginative Walks: Generative Random Walk Deviation Loss for Improved Unseen Learning RepresentationCode0
Improved Modeling of 3D Shapes with Multi-view Depth MapsCode0
ComposeAnyone: Controllable Layout-to-Human Generation with Decoupled Multimodal ConditionsCode0
Image Inpainting via Tractable Steering of Diffusion ModelsCode0
Image Processing Using Multi-Code GAN PriorCode0
Complexity Control Facilitates Reasoning-Based Compositional Generalization in TransformersCode0
Image Generation Via Minimizing Fréchet Distance in Discriminator Feature SpaceCode0
Complex Image Generation SwinTransformer Network for Audio DenoisingCode0
Image Generation from Sketch Constraint Using Contextual GANCode0
Show:102550
← PrevPage 62 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified