SOTAVerified

Named Entity Driven Zero-Shot Image Manipulation

2024-01-01CVPR 2024Code Available0· sign in to hype

Zhida Feng, Li Chen, Jing Tian, Jiaxiang Liu, Shikun Feng

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We introduced StyleEntity a zero-shot image manipulation model that utilizes named entities as proxies during its training phase. This strategy enables our model to manipulate images using unseen textual descriptions during inference all within a single training phase. Additionally we proposed an inference technique termed Prompt Ensemble Latent Averaging (PELA). PELA averages the manipulation directions derived from various named entities during inference effectively eliminating the noise directions thus achieving stable manipulation. In our experiments StyleEntity exhibited superior performance in a zero-shot setting compared to other methods. The code model weights and datasets is available at https://github.com/feng-zhida/StyleEntity.

Tasks

Reproductions