Channel-Temporal Attention for First-Person Video Domain Adaptation

2021-08-17Unverified0· sign in to hype

Xianyuan Liu, Shuo Zhou, Tao Lei, Haiping Lu

Unverified — Be the first to reproduce this paper.

Abstract

Unsupervised Domain Adaptation (UDA) can transfer knowledge from labeled source data to unlabeled target data of the same categories. However, UDA for first-person action recognition is an under-explored problem, with lack of datasets and limited consideration of first-person video characteristics. This paper focuses on addressing this problem. Firstly, we propose two small-scale first-person video domain adaptation datasets: ADL_small and GTEA-KITCHEN. Secondly, we introduce channel-temporal attention blocks to capture the channel-wise and temporal-wise relationships and model their inter-dependencies important to first-person vision. Finally, we propose a Channel-Temporal Attention Network (CTAN) to integrate these blocks into existing architectures. CTAN outperforms baselines on the two proposed datasets and one existing dataset EPIC_cvpr20.

Tasks

Action Recognition Domain Adaptation Unsupervised Domain Adaptation

Channel-Temporal Attention for First-Person Video Domain Adaptation

Abstract

Tasks

Reproductions