SMDT: Cross-View Geo-Localization with Image Alignment and Transformer

2022-04-06IEEE International Conference on Multimedia and Expo 2022 2022Code Available0· sign in to hype

Xiaoyang Tian, Jie Shao, Deqiang Ouyang, Anjie Zhu, Feiyu Chen.

Code Available — Be the first to reproduce this paper.

Code

github.com/TianXiaoYang-txy/SMDT_PyTorch
pytorch★ 4
github.com/2023-MindSpore-1/ms-code-93
mindspore★ 1
github.com/2023-MindSpore-1/ms-code-3
mindspore★ 1
github.com/TianXiaoYang-txy/SMDT_MindSpore
mindspore★ 1

Abstract

The goal of cross-view geo-localization is to determine the location of a given ground image by matching with aerial images. However, existing methods ignore the variability of scenes, additional information and spatial correspondence of covisibility and non-convisibility areas in ground-aerial image pairs. In this context, we propose a cross-view matching method called SMDT with image alignment and Transformer. First, we utilize semantic segmentation technique to segment different areas. Then, we convert the vertical view of aerial images to front view by mixing polar mapping and perspective mapping. Next, we simultaneously train dual conditional generative adversarial nets by taking the semantic segmentation images and converted images as input to synthesize the aerial image with ground view style. These steps are collectively referred to as image alignment. Last, we use Transformer to explicitly utilize the properties of selfattention. Experiments show that our SMDT method is superior to the existing ground-to-aerial cross-view methods.

Tasks

geo-localization Segmentation Semantic Segmentation

SMDT: Cross-View Geo-Localization with Image Alignment and Transformer

Code

Abstract

Tasks

Reproductions