SOTAVerified

Personalized Image Generation

Utilizes single or multiple images that contain the same subject or style, along with text prompt, to generate images that contain that subject as well as match the textual description. Includes finetuning-based methods (e.g. DreamBooth, Textual Inversion) as well as encoder-based methods (e.g. E4T, ELITE, and IP-Adapter, etc.).

Papers

Showing 1120 of 58 papers

TitleStatusHype
RectifID: Personalizing Rectified Flow with Anchored Classifier GuidanceCode2
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image GenerationCode2
Subject-Diffusion:Open Domain Personalized Text-to-Image Generation without Test-time Fine-tuningCode2
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized AttentionCode2
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept CompositionCode2
EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidanceCode2
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text EncoderCode2
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion ModelsCode1
PersonaMagic: Stage-Regulated High-Fidelity Face Customization with Tandem EquilibriumCode1
Personalized Text-to-Image Generation with Auto-Regressive ModelsCode1
Show:102550
← PrevPage 2 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DreamBooth LoRA SDXL v1.0Overall (CP * PF)0.52Unverified
2IP-Adapter ViT-G SDXL v1.0Overall (CP * PF)0.38Unverified
3Emu2 SDXL v1.0Overall (CP * PF)0.36Unverified
4DreamBooth SD v1.5Overall (CP * PF)0.36Unverified
5IP-Adapter-Plus ViT-H SDXL v1.0Overall (CP * PF)0.34Unverified
6BLIP-Diffusion SD v1.5Overall (CP * PF)0.27Unverified
7Textual Inversion SD v1.5Overall (CP * PF)0.24Unverified