V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning Jun 11, 2025 Action Anticipation Large Language Model
Code Code Available 75 EgoExoLearn: A Dataset for Bridging Asynchronous Ego- and Exo-centric View of Procedural Activities in Real World Mar 24, 2024 Action Anticipation Action Quality Assessment
Code Code Available 25 EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation Jun 26, 2024 Action Anticipation Action Recognition
Code Code Available 25 Learning State-Aware Visual Representations from Audible Interactions Sep 27, 2022 Action Anticipation Action Recognition
Code Code Available 15 Future Transformer for Long-term Action Anticipation May 27, 2022 Action Anticipation Long Term Action Anticipation
Code Code Available 15 MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Action Anticipation Jan 1, 2025 Action Anticipation Mamba
Code Code Available 15 MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition Jan 20, 2022 Action Anticipation Action Classification
Code Code Available 15 Real-time Online Video Detection with Temporal Smoothing Transformers Sep 19, 2022 Action Anticipation Action Detection
Code Code Available 15 Rethinking Learning Approaches for Long-Term Action Anticipation Oct 20, 2022 Action Anticipation Future prediction
Code Code Available 15 Pedestrian 3D Bounding Box Prediction Jun 28, 2022 Action Anticipation Autonomous Driving
Code Code Available 15 Action Anticipation with Goal Consistency Jun 26, 2023 Action Anticipation
Code Code Available 15 Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation Jul 16, 2024 Action Anticipation Autonomous Driving
Code Code Available 15 AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? Jul 31, 2023 Action Anticipation counterfactual
Code Code Available 15 Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation Oct 23, 2022 Action Anticipation
Code Code Available 15 What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention May 22, 2019 Action Anticipation Action Recognition
Code Code Available 15 Rolling-Unrolling LSTMs for Action Anticipation from First-Person Video May 4, 2020 Action Anticipation Action Recognition
Code Code Available 15 Technical Report: Temporal Aggregate Representations Jun 6, 2021 Action Anticipation Action Recognition
Code Code Available 15 Intention-Conditioned Long-Term Human Egocentric Action Forecasting Jul 25, 2022 Action Anticipation Long Term Action Anticipation
Code Code Available 15 Multimodal Large Models Are Effective Action Anticipators Jan 1, 2025 Action Anticipation Long Term Action Anticipation
Code Code Available 15 Pedestrian Action Anticipation using Contextual Feature Fusion in Stacked RNNs May 13, 2020 Action Anticipation Autonomous Vehicles
Code Code Available 15 Action Scene Graphs for Long-Form Understanding of Egocentric Videos Dec 6, 2023 Action Anticipation Form
Code Code Available 15 Palm: Predicting Actions through Language Models @ Ego4D Long-Term Action Anticipation Challenge 2023 Jun 28, 2023 Action Anticipation Image Captioning
Code Code Available 15 A Dynamic Spatial-temporal Attention Network for Early Anticipation of Traffic Accidents Jun 18, 2021 Accident Anticipation Action Anticipation
Code Code Available 15 Semantically Guided Representation Learning For Action Anticipation Jul 2, 2024 Action Anticipation Representation Learning
Code Code Available 15 Anticipative Video Transformer Jun 3, 2021 Action Anticipation
Code Code Available 15 Temporal Aggregate Representations for Long-Range Video Understanding Jun 1, 2020 Action Anticipation Action Recognition
Code Code Available 15 Rescaling Egocentric Vision Jun 23, 2020 Action Anticipation Action Detection
Code Code Available 15 Higher Order Recurrent Space-Time Transformer for Video Action Prediction Apr 17, 2021 Action Anticipation Action Recognition
Code Code Available 15 Video + CLIP Baseline for Ego4D Long-term Action Anticipation Jul 1, 2022 Action Anticipation Long Term Action Anticipation
Code Code Available 15 Video Representation Learning with Visual Tempo Consistency Jun 28, 2020 Action Anticipation Action Detection
Code Code Available 15 Enhancing Next Active Object-based Egocentric Action Anticipation with Guided Attention May 22, 2023 Action Anticipation Object
Code Code Available 05 Encouraging LSTMs to Anticipate Actions Very Early Mar 21, 2017 Action Anticipation Autonomous Navigation
Code Code Available 05 Text-Derived Knowledge Helps Vision: A Simple Cross-modal Distillation for Video-based Action Anticipation Oct 12, 2022 Action Anticipation Transfer Learning
Code Code Available 05 TransAction: ICL-SJTU Submission to EPIC-Kitchens Action Anticipation Challenge 2021 Jul 28, 2021 Action Anticipation
Code Code Available 05 Technical Report for Ego4D Long Term Action Anticipation Challenge 2023 Jul 4, 2023 Action Anticipation Decoder
Code Code Available 05 Interaction Region Visual Transformer for Egocentric Action Anticipation Nov 25, 2022 Action Anticipation Human-Object Interaction Detection
Code Code Available 05 Scaling Egocentric Vision: The EPIC-KITCHENS Dataset Apr 8, 2018 Action Anticipation
Code Code Available 05 Unified Recurrence Modeling for Video Action Anticipation Jun 2, 2022 Action Anticipation Decision Making
Code Code Available 05 RED: Reinforced Encoder-Decoder Networks for Action Anticipation Jul 16, 2017 Action Anticipation Decoder
Code Code Available 05 Predicting the Next Action by Modeling the Abstract Goal Sep 12, 2022 Action Anticipation
Code Code Available 05 QuIIL at T3 challenge: Towards Automation in Life-Saving Intervention Procedures from First-Person View Jul 18, 2024 Action Anticipation Action Recognition
Code Code Available 05 Object-centric Video Representation for Long-term Action Anticipation Oct 31, 2023 Action Anticipation Human-Object Interaction Detection
Code Code Available 05 Hierarchical and Multimodal Data for Daily Activity Understanding Apr 24, 2025 Action Anticipation counterfactual
Code Code Available 05 HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN Dec 10, 2019 Action Anticipation Action Classification
Code Code Available 05 Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities Mar 28, 2022 3D Action Recognition Action Anticipation
Code Code Available 05 Mamba Fusion: Learning Actions Through Questioning Sep 17, 2024 Action Anticipation Action Recognition
Code Code Available 05 Forecasting Human-Object Interaction: Joint Prediction of Motor Attention and Actions in First Person Video Nov 25, 2019 Action Anticipation Human-Object Interaction Detection
Code Code Available 05 Action Anticipation from SoccerNet Football Video Broadcasts Apr 16, 2025 Action Anticipation Action Spotting
Code Code Available 05 Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos Feb 7, 2023 Action Anticipation Action Recognition
Code Code Available 05 From Recognition to Prediction: Leveraging Sequence Reasoning for Action Anticipation Aug 5, 2024 Action Anticipation Action Recognition
Code Code Available 05