SOTAVerified

BoQ: A Place is Worth a Bag of Learnable Queries

2024-05-12CVPR 2024Code Available2· sign in to hype

Amar Ali-bey, Brahim Chaib-Draa, Philippe Giguère

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In visual place recognition, accurately identifying and matching images of locations under varying environmental conditions and viewpoints remains a significant challenge. In this paper, we introduce a new technique, called Bag-of-Queries (BoQ), which learns a set of global queries designed to capture universal place-specific attributes. Unlike existing methods that employ self-attention and generate the queries directly from the input features, BoQ employs distinct learnable global queries, which probe the input features via cross-attention, ensuring consistent information aggregation. In addition, our technique provides an interpretable attention mechanism and integrates with both CNN and Vision Transformer backbones. The performance of BoQ is demonstrated through extensive experiments on 14 large-scale benchmarks. It consistently outperforms current state-of-the-art techniques including NetVLAD, MixVPR and EigenPlaces. Moreover, as a global retrieval technique (one-stage), BoQ surpasses two-stage retrieval methods, such as Patch-NetVLAD, TransVPR and R2Former, all while being orders of magnitude faster and more efficient. The code and model weights are publicly available at https://github.com/amaralibey/Bag-of-Queries.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
AmsterTimeBoQ (ResNet-50)Recall@152.2Unverified
AmsterTimeBoQRecall@163Unverified
EynshamBoQRecall@192.2Unverified
EynshamBoQ (ResNet-50)Recall@191.3Unverified
Mapillary testBoQRecall@179Unverified
Mapillary valBoQ (ResNet-50)Recall@191.2Unverified
Mapillary valBoQRecall@193.8Unverified
NordlandBoQ (ResNet-50)Recall@183.1Unverified
NordlandBoQRecall@190.6Unverified
Nordland* (2760 queries)BoQRecall@181.3Unverified
Pittsburgh-250k-testBoQ (ResNet-50)Recall@195Unverified
Pittsburgh-250k-testBoQRecall@196.6Unverified
Pittsburgh-30k-testBoQRecall@193.7Unverified
Pittsburgh-30k-testBoQ (ResNet-50)Recall@192.4Unverified
San Francisco Landmark DatasetBoQRecall@193.6Unverified
SPEDBoQ (ResNet-50)Recall@186.5Unverified
SPEDBoQRecall@192.5Unverified
St LuciaBoQRecall@5100Unverified
St LuciaBoQ (DINOv2)Recall@1100Unverified
SVOX NightBoQ (ResNet-50)Recall@187.1Unverified
SVOX-OvercastBoQ (ResNet-50)Recall@197.8Unverified
SVOX-RainBoQ (ResNet-50)Recall@196.2Unverified
SVOX-SnowBoQ (ResNet-50)Recall@198.7Unverified
SVOX SunBoQ (ResNet-50)Recall@195.9Unverified
Tokyo247BoQRecall@198.1Unverified

Reproductions