SOTAVerified

COTR: Correspondence Transformer for Matching Across Images

2021-03-25ICCV 2021Code Available1· sign in to hype

Wei Jiang, Eduard Trulls, Jan Hosang, Andrea Tagliasacchi, Kwang Moo Yi

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We propose a novel framework for finding correspondences in images based on a deep neural network that, given two images and a query point in one of them, finds its correspondence in the other. By doing so, one has the option to query only the points of interest and retrieve sparse correspondences, or to query all points in an image and obtain dense mappings. Importantly, in order to capture both local and global priors, and to let our model relate between image regions using the most relevant among said priors, we realize our network using a transformer. At inference time, we apply our correspondence network by recursively zooming in around the estimates, yielding a multiscale pipeline able to provide highly-accurate correspondences. Our method significantly outperforms the state of the art on both sparse and dense correspondence problems on multiple datasets and tasks, ranging from wide-baseline stereo to optical flow, without any retraining for a specific dataset. We commit to releasing data, code, and all the tools necessary to train from scratch and ensure reproducibility.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
ETH3DCOTRAEPE (rate=3)1.66Unverified
ETH3DCOTR +Interp.AEPE (rate=5)1.71Unverified
HPatchesCOTRViewpoint I AEPE7.75Unverified
HPatchesCOTR +Interp.Viewpoint I AEPE7.98Unverified
KITTI 2012COTRAverage End-Point Error1.28Unverified
KITTI 2012COTR +Interp.Average End-Point Error2.62Unverified
KITTI 2015COTRAverage End-Point Error2.26Unverified
KITTI 2015COTR +Interp.Average End-Point Error6.12Unverified

Reproductions