SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 58015850 of 6689 papers

TitleStatusHype
Coarse-to-Fine Gaze Redirection with Numerical and Pictorial GuidanceCode0
Text-Guided Neural Image InpaintingCode1
Semantic Image Manipulation Using Scene GraphsCode1
Multimodal Image Synthesis with Conditional Implicit Maximum Likelihood EstimationCode1
Evolving Normalization-Activation LayersCode1
Rethinking Spatially-Adaptive Normalization0
GANSpace: Discovering Interpretable GAN ControlsCode2
Feature Quantization Improves GAN TrainingCode1
Discriminator Contrastive Divergence: Semi-Amortized Generative Modeling by Exploring Energy of the DiscriminatorCode0
Structural-analogy from a Single Image PairCode1
Theoretical Insights into the Use of Structural Similarity Index In Generative Models and Inferential Autoencoders0
Open Domain Dialogue Generation with Latent Images0
A New Dataset and Boundary-Attention Semantic Segmentation for Face Parsing0
Bodies at Rest: 3D Human Pose and Shape Estimation from a Pressure Image using Synthetic DataCode1
Object-Centric Image Generation with Factored Depths, Locations, and Appearances0
Synthesis and Edition of Ultrasound Images via Sketch Guided Progressive Growing GANs0
Edge Guided GANs with Contrastive Learning for Semantic Image SynthesisCode1
Learning Generative Models of Tissue Organization with Supervised GANsCode0
Pathological Retinal Region Segmentation From OCT Images Using Geometric Relation Based Augmentation0
Lesion Conditional Image Generation for Improved Segmentation of Intracranial Hemorrhage from CT Images0
Semantically Multi-modal Image SynthesisCode1
Learning Implicit Surface Light FieldsCode2
Controllable Person Image Synthesis with Attribute-Decomposed GANCode1
Cycle Text-To-Image GAN with BERTCode1
StrokeCoder: Path-Based Image Generation from Single Examples using Transformers0
Image Generation Via Minimizing Fréchet Distance in Discriminator Feature SpaceCode0
BachGAN: High-Resolution Image Synthesis from Salient Object LayoutCode1
Improved Techniques for Training Single-Image GANsCode1
Learning Layout and Style Reconfigurable GANs for Controllable Image SynthesisCode1
UnrealText: Synthesizing Realistic Scene Text Images from the Unreal WorldCode1
Toward Accurate and Realistic Virtual Try-on Through Shape Matching and Multiple Warps0
GAN Compression: Efficient Architectures for Interactive Conditional GANsCode2
A Content Transformation Block For Image Style TransferCode1
Synthesize then Compare: Detecting Failures and Anomalies for Semantic SegmentationCode1
SwapText: Image Based Texts Transfer in Scenes0
Blur, Noise, and Compression Robust Generative Adversarial Networks0
Object-Centric Image Generation from Layouts0
BigGAN-based Bayesian reconstruction of natural images from human brain activity0
Semantic Pyramid for Image GenerationCode1
Efficient Content-Based Sparse Attention with Routing TransformersCode1
Your GAN is Secretly an Energy-based Model and You Should use Discriminator Driven Latent SamplingCode0
Deep Domain-Adversarial Image Generation for Domain Generalisation0
Generalized Energy Based ModelsCode1
Π-nets: Deep Polynomial Neural NetworksCode1
MatchingGAN: Matching-based Few-shot Image GenerationCode1
Anysize GAN: A solution to the image-warping problem0
Hierarchical Modes Exploring in Generative Adversarial Networks0
GANwriting: Content-Conditioned Generation of Styled Handwritten Word ImagesCode1
SketchyCOCO: Image Generation from Freehand Scene SketchesCode1
Unlimited Resolution Image Generation with R2D2-GANs0
Show:102550
← PrevPage 117 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified