| EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation | Jun 26, 2024 | Action AnticipationAction Recognition | CodeCode Available | 2 |
| Multimodal Large Models Are Effective Action Anticipators | Jan 1, 2025 | Action AnticipationLong Term Action Anticipation | CodeCode Available | 1 |
| Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation | Jul 16, 2024 | Action AnticipationAutonomous Driving | CodeCode Available | 1 |
| AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? | Jul 31, 2023 | Action Anticipationcounterfactual | CodeCode Available | 1 |
| Palm: Predicting Actions through Language Models @ Ego4D Long-Term Action Anticipation Challenge 2023 | Jun 28, 2023 | Action AnticipationImage Captioning | CodeCode Available | 1 |
| HierVL: Learning Hierarchical Video-Language Embeddings | Jan 5, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Rethinking Learning Approaches for Long-Term Action Anticipation | Oct 20, 2022 | Action AnticipationFuture prediction | CodeCode Available | 1 |
| Learning State-Aware Visual Representations from Audible Interactions | Sep 27, 2022 | Action AnticipationAction Recognition | CodeCode Available | 1 |
| Intention-Conditioned Long-Term Human Egocentric Action Forecasting | Jul 25, 2022 | Action AnticipationLong Term Action Anticipation | CodeCode Available | 1 |
| Video + CLIP Baseline for Ego4D Long-term Action Anticipation | Jul 1, 2022 | Action AnticipationLong Term Action Anticipation | CodeCode Available | 1 |