SOTAVerified

Personalized Image Generation

Utilizes single or multiple images that contain the same subject or style, along with text prompt, to generate images that contain that subject as well as match the textual description. Includes finetuning-based methods (e.g. DreamBooth, Textual Inversion) as well as encoder-based methods (e.g. E4T, ELITE, and IP-Adapter, etc.).

Papers

Showing 5158 of 58 papers

TitleStatusHype
PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization0
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models0
CatVersion: Concatenating Embeddings for Diffusion-Based Text-to-Image Personalization0
FaceChain: A Playground for Human-centric Artificial Intelligence Generated ContentCode0
BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and EditingCode0
Identity Encoder for Personalized Diffusion0
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning0
Personalized Image Generation for Color Vision Deficiency PopulationCode0
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DreamBooth LoRA SDXL v1.0Overall (CP * PF)0.52Unverified
2IP-Adapter ViT-G SDXL v1.0Overall (CP * PF)0.38Unverified
3Emu2 SDXL v1.0Overall (CP * PF)0.36Unverified
4DreamBooth SD v1.5Overall (CP * PF)0.36Unverified
5IP-Adapter-Plus ViT-H SDXL v1.0Overall (CP * PF)0.34Unverified
6BLIP-Diffusion SD v1.5Overall (CP * PF)0.27Unverified
7Textual Inversion SD v1.5Overall (CP * PF)0.24Unverified