| Egocentric Video-Language Pretraining | Jun 3, 2022 | Action RecognitionContrastive Learning | CodeCode Available | 2 |
| Egocentric Video-Language Pretraining @ Ego4D Challenge 2022 | Jul 4, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens | Nov 19, 2022 | Action RecognitionObject State Change Classification | CodeCode Available | 1 |
| Learning State-Aware Visual Representations from Audible Interactions | Sep 27, 2022 | Action AnticipationAction Recognition | CodeCode Available | 1 |
| Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022 | Jul 22, 2022 | ObjectObject State Change Classification | —Unverified | 0 |
| Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022 | Nov 16, 2022 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Anticipating Object State Changes in Long Procedural Videos | May 21, 2024 | ObjectObject State Change Classification | —Unverified | 0 |
| Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention Mechanism | Jul 24, 2022 | ObjectObject State Change Classification | CodeCode Available | 0 |
| Masked Autoencoders for Egocentric Video Understanding @ Ego4D Challenge 2022 | Nov 18, 2022 | Object State Change ClassificationTemporal Localization | CodeCode Available | 0 |