Unsupervised Semantic Correspondence Using Stable Diffusion

2023-05-24NeurIPS 2023Code Available1· sign in to hype

Eric Hedlin, Gopal Sharma, Shweta Mahajan, Hossam Isack, Abhishek Kar, Andrea Tagliasacchi, Kwang Moo Yi

Code Available — Be the first to reproduce this paper.

Code

github.com/ubc-vision/LDM_correspondences
OfficialIn paperpytorch★ 59

Abstract

Text-to-image diffusion models are now capable of generating images that are often indistinguishable from real images. To generate such images, these models must understand the semantics of the objects they are asked to generate. In this work we show that, without any training, one can leverage this semantic knowledge within diffusion models to find semantic correspondences - locations in multiple images that have the same semantic meaning. Specifically, given an image, we optimize the prompt embeddings of these models for maximum attention on the regions of interest. These optimized embeddings capture semantic information about the location, which can then be transferred to another image. By doing so we obtain results on par with the strongly supervised state of the art on the PF-Willow dataset and significantly outperform (20.9% relative for the SPair-71k dataset) any existing weakly or unsupervised method on PF-Willow, CUB-200 and SPair-71k datasets.

Tasks

Semantic correspondence

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CUB-200-2011	LDM Correspondences	Mean PCK@0.05	61.6	—	Unverified
PF-WILLOW	LDMCorrespondences	PCK	84.3	—	Unverified
SPair-71k	LDMCorrespondences	PCK	45.4	—	Unverified

Unsupervised Semantic Correspondence Using Stable Diffusion

Code

Abstract

Tasks

Benchmark Results

Reproductions