| Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos | Apr 28, 2022 | Action UnderstandingVideo Captioning | CodeCode Available | 0 | 5 |
| LLaVA-Pose: Enhancing Human Pose and Action Understanding via Keypoint-Integrated Instruction Tuning | Jun 26, 2025 | Action UnderstandingInstruction Following | CodeCode Available | 0 | 5 |
| Online Spatiotemporal Action Detection and Prediction via Causal Representations | Aug 31, 2020 | Action DetectionAction Recognition | CodeCode Available | 0 | 5 |
| Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond | Jun 5, 2024 | Action RecognitionAction Understanding | CodeCode Available | 0 | 5 |
| Video Action Understanding | Oct 13, 2020 | Action UnderstandingDeep Learning | CodeCode Available | 0 | 5 |
| CathAction: A Benchmark for Endovascular Intervention Understanding | Aug 23, 2024 | Action Understanding | —Unverified | 0 | 0 |
| FineGym: A Hierarchical Video Dataset for Fine-grained Action Understanding | Apr 14, 2020 | Action RecognitionAction Understanding | —Unverified | 0 | 0 |
| Can Humans Fly? Action Understanding With Multiple Classes of Actors | Jun 1, 2015 | Action RecognitionAction Understanding | —Unverified | 0 | 0 |
| Exploring Uncertainty in Conditional Multi-Modal Retrieval Systems | Jan 23, 2019 | Action UnderstandingPerson Re-Identification | —Unverified | 0 | 0 |
| Explainable robotic systems: Understanding goal-driven actions in a reinforcement learning scenario | Jun 24, 2020 | Action UnderstandingDecision Making | —Unverified | 0 | 0 |