| Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports | Jan 3, 2024 | Action Understandingcounterfactual | CodeCode Available | 1 | 5 |
| Few-Shot Fine-Grained Action Recognition via Bidirectional Attention and Contrastive Meta-Learning | Aug 15, 2021 | Action RecognitionAction Understanding | CodeCode Available | 0 | 5 |
| Action Recognition with Trajectory-Pooled Deep-Convolutional Descriptors | May 19, 2015 | Action RecognitionAction Understanding | CodeCode Available | 0 | 5 |
| Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos | Apr 28, 2022 | Action UnderstandingVideo Captioning | CodeCode Available | 0 | 5 |
| ACT-Thor: A Controlled Benchmark for Embodied Action Understanding in Simulated Environments | Oct 1, 2022 | Action Understanding | CodeCode Available | 0 | 5 |
| Win-Fail Action Recognition | Feb 15, 2021 | Action RecognitionAction Understanding | CodeCode Available | 0 | 5 |
| Online Spatiotemporal Action Detection and Prediction via Causal Representations | Aug 31, 2020 | Action DetectionAction Recognition | CodeCode Available | 0 | 5 |
| LLaVA-Pose: Enhancing Human Pose and Action Understanding via Keypoint-Integrated Instruction Tuning | Jun 26, 2025 | Action UnderstandingInstruction Following | CodeCode Available | 0 | 5 |
| Video Action Understanding | Oct 13, 2020 | Action UnderstandingDeep Learning | CodeCode Available | 0 | 5 |
| Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond | Jun 5, 2024 | Action RecognitionAction Understanding | CodeCode Available | 0 | 5 |
| mRI: Multi-modal 3D Human Pose Estimation Dataset using mmWave, RGB-D, and Inertial Sensors | Oct 15, 2022 | 3D Human Pose EstimationAction Detection | —Unverified | 0 | 0 |
| Multitask Learning in Minimally Invasive Surgical Vision: A Review | Jan 16, 2024 | Action Understanding | —Unverified | 0 | 0 |
| PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition | Apr 17, 2025 | Action RecognitionAction Understanding | —Unverified | 0 | 0 |
| PKU-MMD: A Large Scale Benchmark for Continuous Multi-Modal Human Action Understanding | Mar 22, 2017 | Action DetectionAction Recognition | —Unverified | 0 | 0 |
| Probing Fine-Grained Action Understanding and Cross-View Generalization of Foundation Models | Jul 22, 2024 | Action UnderstandingActivity Recognition | —Unverified | 0 | 0 |
| Region-aware Image-based Human Action Retrieval with Transformers | Jul 13, 2024 | Action RecognitionAction Understanding | —Unverified | 0 | 0 |
| RoboAct-CLIP: Video-Driven Pre-training of Atomic Action Understanding for Robotics | Apr 2, 2025 | Action UnderstandingRepresentation Learning | —Unverified | 0 | 0 |
| Scene Understanding for Autonomous Manipulation with Deep Learning | Mar 23, 2019 | Action UnderstandingAffordance Detection | —Unverified | 0 | 0 |
| ScreenLLM: Stateful Screen Schema for Efficient Action Understanding and Prediction | Mar 26, 2025 | Action Understanding | —Unverified | 0 | 0 |
| Self-supervised Discovery of Human Actons from Long Kinematic Videos | Sep 29, 2021 | Action UnderstandingSentence | —Unverified | 0 | 0 |
| Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning | Apr 8, 2024 | Action UnderstandingDecoder | —Unverified | 0 | 0 |
| STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding | Jan 1, 2025 | Action UnderstandingSpatio-Temporal Video Grounding | —Unverified | 0 | 0 |
| Theory of Minds: Understanding Behavior in Groups Through Inverse Planning | Jan 18, 2019 | Action UnderstandingBayesian Inference | —Unverified | 0 | 0 |
| The Role of Video Generation in Enhancing Data-Limited Action Understanding | May 26, 2025 | Action RecognitionAction Understanding | —Unverified | 0 | 0 |
| About Time: Advances, Challenges, and Outlooks of Action Understanding | Nov 22, 2024 | Action UnderstandingSurvey | —Unverified | 0 | 0 |