Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022
2022-07-22Unverified0· sign in to hype
Maria Escobar, Laura Daza, Cristina González, Jordi Pont-Tuset, Pablo Arbeláez
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We implemented Video Swin Transformer as a base architecture for the tasks of Point-of-No-Return temporal localization and Object State Change Classification. Our method achieved competitive performance on both challenges.