Cross-View Completion Models are Zero-shot Correspondence Estimators

2024-12-12CVPR 2025Unverified0· sign in to hype

Honggyu An, Jinhyeon Kim, Seonghoon Park, Jaewoo Jung, Jisang Han, Sunghwan Hong, Seungryong Kim

Unverified — Be the first to reproduce this paper.

Abstract

In this work, we explore new perspectives on cross-view completion learning by drawing an analogy to self-supervised correspondence learning. Through our analysis, we demonstrate that the cross-attention map within cross-view completion models captures correspondence more effectively than other correlations derived from encoder or decoder features. We verify the effectiveness of the cross-attention map by evaluating on both zero-shot matching and learning-based geometric matching and multi-frame depth estimation. Project page is available at https://cvlab-kaist.github.io/ZeroCo/.

Tasks

Decoder Depth Estimation Geometric Matching

Cross-View Completion Models are Zero-shot Correspondence Estimators

Abstract

Tasks

Reproductions