| Region-aware Image-based Human Action Retrieval with Transformers | Jul 13, 2024 | Action RecognitionAction Understanding | —Unverified | 0 |
| RoboAct-CLIP: Video-Driven Pre-training of Atomic Action Understanding for Robotics | Apr 2, 2025 | Action UnderstandingRepresentation Learning | —Unverified | 0 |
| Scene Understanding for Autonomous Manipulation with Deep Learning | Mar 23, 2019 | Action UnderstandingAffordance Detection | —Unverified | 0 |
| ScreenLLM: Stateful Screen Schema for Efficient Action Understanding and Prediction | Mar 26, 2025 | Action Understanding | —Unverified | 0 |
| Self-supervised Discovery of Human Actons from Long Kinematic Videos | Sep 29, 2021 | Action UnderstandingSentence | —Unverified | 0 |
| Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning | Apr 8, 2024 | Action UnderstandingDecoder | —Unverified | 0 |
| STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding | Jan 1, 2025 | Action UnderstandingSpatio-Temporal Video Grounding | —Unverified | 0 |
| The SkatingVerse Workshop & Challenge: Methods and Results | May 27, 2024 | Action Understanding | —Unverified | 0 |
| Action Understanding with Multiple Classes of Actors | Apr 27, 2017 | Action RecognitionAction Segmentation | —Unverified | 0 |
| Actor and Action Modular Network for Text-based Video Segmentation | Nov 2, 2020 | Action SegmentationAction Understanding | —Unverified | 0 |