SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 58015850 of 6689 papers

TitleStatusHype
Synthesizing New Retinal Symptom Images by Multiple Generative ModelsCode0
Im-Promptu: In-Context Composition from Image PromptsCode0
Exploring Generative Adversarial Networks for Text-to-Image Generation with Evolution StrategiesCode0
Improved ArtGAN for Conditional Synthesis of Natural Image and ArtworkCode0
Improved Conditional Flow Models for Molecule to Image SynthesisCode0
Cross-Domain Adversarial Auto-EncoderCode0
Token Pruning for Caching Better: 9 Times Acceleration on Stable Diffusion for FreeCode0
Attribute2Image: Conditional Image Generation from Visual AttributesCode0
Exploring Precision and Recall to assess the quality and diversity of LLMsCode0
Deep Generative Models for Distribution-Preserving Lossy CompressionCode0
Visual Story Generation Based on Emotion and KeywordsCode0
Exploring Sparsity for Parameter Efficient Fine Tuning Using WaveletsCode0
Unsupervised Text Embedding Space Generation Using Generative Adversarial Networks for Text SynthesisCode0
Exploring Text-Guided Single Image Editing for Remote Sensing ImagesCode0
Vector quantization loss analysis in VQGANs: a single-GPU ablation study for image-to-image synthesisCode0
Tabular GANs for uneven distributionCode0
Improved Modeling of 3D Shapes with Multi-view Depth MapsCode0
ANCHOR: LLM-driven News Subject Conditioning for Text-to-Image SynthesisCode0
Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image SynthesisCode0
MIGS: Meta Image Generation from Scene GraphsCode0
Image Synthesis with a Single (Robust) ClassifierCode0
Shrink-Perturb Improves Architecture Mixing during Population Based Training for Neural Architecture SearchCode0
Breast Cancer Immunohistochemical Image Generation: a Benchmark Dataset and Challenge ReviewCode0
MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPOCode0
Perturbation-Assisted Sample Synthesis: A Novel Approach for Uncertainty QuantificationCode0
Deep Generative Models Unveil Patterns in Medical Images Through Vision-Language ConditioningCode0
Active Generation for Image ClassificationCode0
Triangle Generative Adversarial NetworksCode0
Learning Modality-Aware Representations: Adaptive Group-wise Interaction Network for Multimodal MRI SynthesisCode0
TorchGAN: A Flexible Framework for GAN Training and EvaluationCode0
Improved Variational Inference with Inverse Autoregressive FlowCode0
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning FrameworkCode0
Improving Fine-Grained Control via Aggregation of Multiple Diffusion ModelsCode0
Phase-Based Frame Interpolation for VideoCode0
A Prompt Log Analysis of Text-to-Image Generation SystemsCode0
Diverse Image Generation with Diffusion Models and Cross Class Label Learning for Polyp ClassificationCode0
Exposing Image Splicing Traces in Scientific Publications via Uncertainty-guided RefinementCode0
Cascaded Diffusion Models for 2D and 3D Microscopy Image Synthesis to Enhance Cell SegmentationCode0
Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based DiscriminationCode0
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and GenerationCode0
MintNet: Building Invertible Neural Networks with Masked ConvolutionsCode0
MirrorGAN: Learning Text-to-image Generation by RedescriptionCode0
Improving Compositional Generation with Diffusion Models Using Lift ScoresCode0
Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial NetworkCode0
MISFIT-V: Misaligned Image Synthesis and Fusion using Information from Thermal and VisualCode0
FaceForensics++: Learning to Detect Manipulated Facial ImagesCode0
Diffusion-HMC: Parameter Inference with Diffusion-model-driven Hamiltonian Monte CarloCode0
Analyzing the Feature Extractor Networks for Face Image SynthesisCode0
Improving Diffusion-Based Generative Models via Approximated Optimal TransportCode0
ResVG: Enhancing Relation and Semantic Understanding in Multiple Instances for Visual GroundingCode0
Show:102550
← PrevPage 117 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified