SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 37513800 of 6689 papers

TitleStatusHype
Hand1000: Generating Realistic Hands from Text with Only 1,000 Images0
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances0
Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging0
Hand-Object Interaction Image Generation0
HanDrawer: Leveraging Spatial Information to Render Realistic Hands Using a Conditional Diffusion Model in Single Stage0
Happy People -- Image Synthesis as Black-Box Optimization Problem in the Discrete Latent Space of Deep Generative Models0
Harmonizing Maximum Likelihood with GANs for Multimodal Conditional Generation0
Harnessing Caption Detailness for Data-Efficient Text-to-Image Generation0
The Infinite Index: Information Retrieval on Generative Text-To-Image Models0
Harnessing Global-Local Collaborative Adversarial Perturbation for Anti-Customization0
Creating Image Datasets in Agricultural Environments using DALL.E: Generative AI-Powered Large Language Model0
Voice command generation using Progressive Wavegans0
AutoLoss: Learning Discrete Schedule for Alternate Optimization0
Harvesting Visual Objects from Internet Images via Deep Learning Based Objectness Assessment0
Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability0
Head and Neck Tumor Segmentation from [18F]F-FDG PET/CT Images Based on 3D Diffusion Model0
HeadRouter: A Training-free Image Editing Framework for MM-DiTs by Adaptively Routing Attention Heads0
Voltage-Controlled Magnetoelectric Devices for Neuromorphic Diffusion Process0
Heat Death of Generative Models in Closed-Loop Learning0
HepatoGEN: Generating Hepatobiliary Phase MRI with Perceptual and Adversarial Models0
Heredity-aware Child Face Image Generation with Latent Space Disentanglement0
HessianFR: An Efficient Hessian-based Follow-the-Ridge Algorithm for Minimax Optimization0
Heuristics for Image Generation from Scene Graphs0
Autoencoding Video Latents for Adversarial Video Generation0
Hide and Seek: How Does Watermarking Impact Face Recognition?0
HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models0
Autoencoding Labeled Interpolator, Inferring Parameters From Image, And Image From Parameters0
The Male CEO and the Female Assistant: Evaluation and Mitigation of Gender Biases in Text-To-Image Generation of Dual Subjects0
The Myth of Culturally Agnostic AI Models0
Hierarchical Diffusion Autoencoders and Disentangled Image Manipulation0
The Neural Painter: Multi-Turn Image Generation0
Theoretical Insights for Diffusion Guidance: A Case Study for Gaussian Mixture Models0
Hierarchical Modes Exploring in Generative Adversarial Networks0
Theoretical Insights into the Use of Structural Similarity Index In Generative Models and Inferential Autoencoders0
Hierarchical Vision-Language Alignment for Text-to-Image Generation via Diffusion Models0
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models0
High-Fidelity Diffusion-based Image Editing0
High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces0
High-Fidelity Guided Image Synthesis with Latent Diffusion Models0
High-Fidelity Image Compression with Score-based Generative Models0
High-Fidelity Image Generation With Fewer Labels0
High-Fidelity Image Synthesis from Pulmonary Nodule Lesion Maps using Semantic Diffusion Model0
The Poisson Midpoint Method for Langevin Dynamics: Provably Efficient Discretization for Diffusion Models0
The power of pictures: using ML assisted image generation to engage the crowd in complex socioscientific problems0
TherapyView: Visualizing Therapy Sessions with Temporal Topic Modeling and AI-Generated Arts0
Highly accelerated MR parametric mapping by undersampling the k-space and reducing the contrast number simultaneously with deep learning0
AUTM Flow: Atomic Unrestricted Time Machine for Monotonic Normalizing Flows0
Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion0
High Noise Scheduling is a Must0
High Precision Score-based Diffusion Models0
Show:102550
← PrevPage 76 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified