Attributable Visual Similarity Learning

2022-03-28CVPR 2022Code Available1· sign in to hype

Borui Zhang, Wenzhao Zheng, Jie zhou, Jiwen Lu

Code Available — Be the first to reproduce this paper.

Code

github.com/zbr17/avsl
OfficialIn paperpytorch★ 34

Abstract

This paper proposes an attributable visual similarity learning (AVSL) framework for a more accurate and explainable similarity measure between images. Most existing similarity learning methods exacerbate the unexplainability by mapping each sample to a single point in the embedding space with a distance metric (e.g., Mahalanobis distance, Euclidean distance). Motivated by the human semantic similarity cognition, we propose a generalized similarity learning paradigm to represent the similarity between two images with a graph and then infer the overall similarity accordingly. Furthermore, we establish a bottom-up similarity construction and top-down similarity inference framework to infer the similarity based on semantic hierarchy consistency. We first identify unreliable higher-level similarity nodes and then correct them using the most coherent adjacent lower-level similarity nodes, which simultaneously preserve traces for similarity attribution. Extensive experiments on the CUB-200-2011, Cars196, and Stanford Online Products datasets demonstrate significant improvements over existing deep similarity learning methods and verify the interpretability of our framework. Code is available at https://github.com/zbr17/AVSL.

Tasks

Metric Learning Semantic Similarity Semantic Textual Similarity

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CARS196	ResNet-50 + AVSL	R@1	91.5	—	Unverified
CUB-200-2011	ResNet-50 + AVSL	R@1	71.9	—	Unverified
Stanford Online Products	ResNet50 + AVSL	R@1	79.6	—	Unverified

Attributable Visual Similarity Learning

Code

Abstract

Tasks

Benchmark Results

Reproductions