Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022

2022-07-22Unverified0· sign in to hype

Maria Escobar, Laura Daza, Cristina González, Jordi Pont-Tuset, Pablo Arbeláez

Unverified — Be the first to reproduce this paper.

Abstract

We implemented Video Swin Transformer as a base architecture for the tasks of Point-of-No-Return temporal localization and Object State Change Classification. Our method achieved competitive performance on both challenges.

Tasks

Object Object State Change Classification Temporal Localization Video Understanding

Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022

Abstract

Tasks

Reproductions