SOTAVerified

Image Generation

Image Generation (synthesis) is the task of generating new images from an existing dataset.

  • Unconditional generation refers to generating samples unconditionally from the dataset, i.e. $p(y)$
  • Conditional image generation (subtask) refers to generating samples conditionally from the dataset, based on a label, i.e. $p(y|x)$.

In this section, you can find state-of-the-art leaderboards for unconditional generation. For conditional generation, and other types of image generations, refer to the subtasks.

( Image credit: StyleGAN )

Papers

Showing 35013550 of 6689 papers

TitleStatusHype
IPAdapter-Instruct: Resolving Ambiguity in Image-based Conditioning using Instruct Prompts0
Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey0
Pre-trained Encoder Inference: Revealing Upstream Encoders In Downstream Machine Learning ServicesCode0
Step Saver: Predicting Minimum Denoising Steps for Diffusion Model Image Generation0
LDFaceNet: Latent Diffusion-based Network for High-Fidelity Deepfake Generation0
PanoFree: Tuning-Free Holistic Multi-view Image Generation with Cross-view Self-Guidance0
A Novel Evaluation Framework for Image2Text Generation0
SkyDiffusion: Ground-to-Aerial Image Synthesis with Diffusion Models and BEV Paradigm0
EIUP: A Training-Free Approach to Erase Non-Compliant Concepts Conditioned on Implicit Unsafe Prompts0
Temporal Evolution of Knee Osteoarthritis: A Diffusion-based Morphing Model for X-ray Medical Image Synthesis0
A new approach for encoding code and assisting code understanding0
A Simple Background Augmentation Method for Object Detection with Diffusion Model0
Synthetic dual image generation for reduction of labeling efforts in semantic segmentation of micrographs with a customized metric function0
Deformable 3D Shape Diffusion Model0
Fine-gained Zero-shot Video Sampling0
Vulnerabilities in AI-generated Image Detection: The Challenge of Adversarial Attacks0
Enhancing Quantitative Image Synthesis through Pretraining and Resolution Scaling for Bone Mineral Density Estimation from a Plain X-ray ImageCode0
Retinex-Diffusion: On Controlling Illumination Conditions in Diffusion Models via Retinex Theory0
MaskInversion: Localized Embeddings via Optimization of Explainability Maps0
Reproducibility Study of "ITI-GEN: Inclusive Text-to-Image Generation"Code0
VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative AdversaryCode0
Artificial Immunofluorescence in a Flash: Rapid Synthetic Imaging from Brightfield Through Residual Diffusion0
Guided Latent Slot Diffusion for Object-Centric Learning0
ViPer: Visual Personalization of Generative Models via Individual Preference Learning0
Utilizing Generative Adversarial Networks for Image Data Augmentation and Classification of Semiconductor Wafer Dicing Induced Defects0
Synthetic Trajectory Generation Through Convolutional Neural NetworksCode0
Artistic Intelligence: A Diffusion-Based Framework for High-Fidelity Landscape Painting Synthesis0
Adaptive Gradient Regularization: A Faster and Generalizable Optimization Technique for Deep Neural Networks0
On Differentially Private 3D Medical Image Synthesis with Controllable Latent Diffusion ModelsCode0
MedEdit: Counterfactual Diffusion-based Image Editing on Brain MRI0
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework0
Distilling Vision-Language Foundation Models: A Data-Free Approach via Prompt Diversification0
Variational Potential Flow: A Novel Probabilistic Framework for Energy-Based Generative Modelling0
-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite DimensionsCode0
GreenStableYolo: Optimizing Inference Time and Image Quality of Text-to-Image GenerationCode0
Diffusion Models as Data Mining Tools0
CoCoG-2: Controllable generation of visual stimuli for understanding human concept representationCode0
Latent Pollution Model: The Hidden Carbon Footprint in 3D Image Synthesis0
Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model0
Are handcrafted filters helpful for attributing AI-generated images?0
Controllable and Efficient Multi-Class Pathology Nuclei Data Augmentation using Text-Conditioned Diffusion Models0
Time Series Generative Learning with Application to Brain Imaging Analysis0
Image Inpainting Models are Effective Tools for Instruction-guided Image Editing0
SUSTechGAN: Image Generation for Object Detection in Adverse Conditions of Autonomous DrivingCode0
URCDM: Ultra-Resolution Image Synthesis in HistopathologyCode0
Safe-SD: Safe and Traceable Stable Diffusion with Text Prompt Trigger for Invisible Generative Watermarking0
Voltage-Controlled Magnetoelectric Devices for Neuromorphic Diffusion Process0
From Principles to Practices: Lessons Learned from Applying Partnership on AI's (PAI) Synthetic Media Framework to 11 Use Cases0
GeoGuide: Geometric guidance of diffusion modelsCode0
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation0
Show:102550
← PrevPage 71 of 134Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Improved DDPMFID12.3Unverified
2ADMFID11.84Unverified
3BigGAN-deepFID8.1Unverified
4Polarity-BigGANFID6.82Unverified
5VQGAN+Transformer (k=mixed, p=1.0, a=0.005)FID6.59Unverified
6MaskGITFID6.18Unverified
7VQGAN+Transformer (k=600, p=1.0, a=0.05)FID5.2Unverified
8CDMFID4.88Unverified
9ADM-GFID4.59Unverified
10RINFID4.51Unverified
#ModelMetricClaimedVerifiedStatus
1PresGANFID52.2Unverified
2RESFLOWFID48.29Unverified
3Residual FlowFID46.37Unverified
4GLF+perceptual loss (ours)FID44.6Unverified
5ProdPoly no activation functionsFID40.45Unverified
6ProdPoly no activation functionsFID36.77Unverified
7ACGANFID35.47Unverified
8DenseFlow-74-10FID34.9Unverified
9NVAE w/ flowFID32.53Unverified
10QSNGANFID31.97Unverified
#ModelMetricClaimedVerifiedStatus
1GLIDE + CLSFID30.87Unverified
2GLIDE + CLIPFID30.46Unverified
3GLIDE + CLS-FREEFID29.22Unverified
4GLIDE + CLIP + CLS + CLS-FREEFID29.18Unverified
5PGMGANFID21.73Unverified
6CLR-GANFID20.27Unverified
7FMFID14.45Unverified
8CT (Direct Generation, NFE=1)FID13Unverified
9CT (Direct Generation, NFE=2)FID11.1Unverified
10GLIDE +CLSKID7.95Unverified