SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 33013350 of 6689 papers

TitleStatusHype
Learning of Inter-Label Geometric Relationships Using Self-Supervised Learning: Application To Gleason Grade Segmentation0
Towards Visual Text Design Transfer Across Languages0
Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection0
Learning Priors for Adversarial Autoencoders0
Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning0
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation0
WAYLA - Generating Images from Eye Movements0
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation0
Learning Spatial Pyramid Attentive Pooling in Image Synthesis and Image-to-Image Translation0
Learning Spatial Relationships between Samples of Patent Image Shapes0
Variational Conditional GAN for Fine-grained Controllable Image Generation0
Learning the Visualness of Text Using Large Vision-Language Models0
Learning to Detect Fake Face Images in the Wild0
TraceMark-LDM: Authenticatable Watermarking for Latent Diffusion Models via Binary-Guided Rearrangement0
Learning to Evaluate the Artness of AI-generated Images0
Learning to Find Missing Video Frames with Synthetic Data Augmentation: A General Framework and Application in Generating Thermal Images Using RGB Cameras0
Learning to Generate Images of Outdoor Scenes from Attributes and Semantic Layouts0
Learning to Generate Images with Perceptual Similarity Metrics0
Tracing the Roots: Leveraging Temporal Dynamics in Diffusion Trajectories for Origin Attribution0
Tracking in Aerial Hyperspectral Videos using Deep Kernelized Correlation Filters0
Learning To Memorize Feature Hallucination for One-Shot Image Generation0
Tracking multiple moving objects in images using Markov Chain Monte Carlo0
Tractable loss function and color image generation of multinary restricted Boltzmann machine0
Absorb and Converge: Provable Convergence Guarantee for Absorbing Discrete Diffusion Models0
Learning to Segment Brain Anatomy from 2D Ultrasound with Less Data0
Traditional Classification Neural Networks are Good Generators: They are Competitive with DDPMs and GANs0
Learning Universal Policies via Text-Guided Video Generation0
Learning Unnormalized Statistical Models via Compositional Optimization0
Learning Versatile 3D Shape Generation with Improved AR Models0
Learning Versatile 3D Shape Generation with Improved Auto-regressive Models0
Learning What and Where to Draw0
LEDiff: Latent Exposure Diffusion for HDR Generation0
LEDITS: Real Image Editing with DDPM Inversion and Semantic Guidance0
Lesion Conditional Image Generation for Improved Segmentation of Intracranial Hemorrhage from CT Images0
Less is More: Unsupervised Mask-guided Annotated CT Image Synthesis with Minimum Manual Segmentations0
Weakly Supervised Annotations for Multi-modal Greeting Cards Dataset0
A Self-attention Guided Multi-scale Gradient GAN for Diversified X-ray Image Synthesis0
Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding0
Let's Verify and Reinforce Image Generation Step by Step0
Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation0
Density-aware Haze Image Synthesis by Self-Supervised Content-Style Disentanglement0
Leveraging Generative AI Models to Explore Human Identity0
Weakly Supervised Keypoint Discovery0
Leveraging Previous Steps: A Training-free Fast Solver for Flow Diffusion0
Weakly-Supervised Photo-realistic Texture Generation for 3D Face Reconstruction0
Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models0
Leveraging Text-to-Image Generation for Handling Spurious Correlation0
Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency0
Leveraging Vision-Language Foundation Models to Reveal Hidden Image-Attribute Relationships in Medical Imaging0
Leveraging Visual Question Answering to Improve Text-to-Image Synthesis0
Show:102550
← PrevPage 67 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified