Rethinking Visual Geo-localization for Large-Scale Applications
Gabriele Berton, Carlo Masone, Barbara Caputo
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/gmberton/cosplaceOfficialIn paperpytorch★ 384
- github.com/stschubert/vpr_tutorialpytorch★ 198
Abstract
Visual Geo-localization (VG) is the task of estimating the position where a given photo was taken by comparing it with a large database of images of known locations. To investigate how existing techniques would perform on a real-world city-wide VG application, we build San Francisco eXtra Large, a new dataset covering a whole city and providing a wide range of challenging cases, with a size 30x bigger than the previous largest dataset for visual geo-localization. We find that current methods fail to scale to such large datasets, therefore we design a new highly scalable training technique, called CosPlace, which casts the training as a classification problem avoiding the expensive mining needed by the commonly used contrastive learning. We achieve state-of-the-art performance on a wide range of datasets and find that CosPlace is robust to heavy domain changes. Moreover, we show that, compared to the previous state-of-the-art, CosPlace requires roughly 80% less GPU memory at train time, and it achieves better results with 8x smaller descriptors, paving the way for city-wide real-world visual geo-localization. Dataset, code and trained models are available for research purposes at https://github.com/gmberton/CosPlace.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 17 Places | CosPlace | Recall@1 | 61.08 | — | Unverified |
| Baidu Mall | CosPlace | Recall@1 | 41.62 | — | Unverified |
| Gardens Point | CosPlace | Recall@1 | 74 | — | Unverified |
| Hawkins | CosPlace | Recall@1 | 31.36 | — | Unverified |
| Laurel Caverns | CosPlace | Recall@1 | 24.11 | — | Unverified |
| Mapillary val | CosPlace | Recall@5 | 89.9 | — | Unverified |
| Mapillary val | CosPlace (ResNet-101 2048-D) | Recall@1 | 86.7 | — | Unverified |
| Mid-Atlantic Ridge | CosPlace | Recall@1 | 20.79 | — | Unverified |
| MSLS | CosPlace | Recall@1 | 79.6 | — | Unverified |
| Nardo-Air | CosPlace | Recall@1 | 0 | — | Unverified |
| Nardo-Air R | CosPlace | Recall@1 | 91.55 | — | Unverified |
| Oxford RobotCar Dataset | CosPlace | Recall@1 | 91.1 | — | Unverified |
| Pittsburgh-250k-test | CosPlace | Recall@1 | 91.5 | — | Unverified |
| Pittsburgh-30k-test | CosPlace (ResNet-101 2048-D) | Recall@1 | 90.4 | — | Unverified |
| Pittsburgh-30k-test | CosPlace | Recall@1 | 90.45 | — | Unverified |
| SF-XL test v1 | CosPlace | Recall@1 | 64.7 | — | Unverified |
| SF-XL test v2 | CosPlace | Recall@1 | 83.4 | — | Unverified |
| St Lucia | CosPlace | Recall@1 | 99.59 | — | Unverified |
| Tokyo247 | CosPlace | Recall@1 | 82.2 | — | Unverified |
| Tokyo247 | CosPlace (ResNet-101 2048-D) | Recall@5 | 95.9 | — | Unverified |
| VP Air | CosPlace | Recall@1 | 8.12 | — | Unverified |