| Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos | Mar 26, 2022 | Action SegmentationAction Understanding | CodeCode Available | 1 | 5 |
| Open-Vocabulary Video Relation Extraction | Dec 25, 2023 | Action ClassificationAction Understanding | CodeCode Available | 1 | 5 |
| PIANO: A Parametric Hand Bone Model from Magnetic Resonance Imaging | Jun 21, 2021 | Action Understanding | CodeCode Available | 1 | 5 |
| Memory-and-Anticipation Transformer for Online Action Understanding | Aug 15, 2023 | Action DetectionAction Understanding | CodeCode Available | 1 | 5 |
| Action Quality Assessment with Temporal Parsing Transformer | Jul 19, 2022 | Action Quality AssessmentAction Understanding | CodeCode Available | 1 | 5 |
| EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding | Jun 13, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 | 5 |
| F^3Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos | Apr 11, 2025 | Action UnderstandingEvent Detection | CodeCode Available | 1 | 5 |
| FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment | May 11, 2024 | Action Quality AssessmentAction Understanding | CodeCode Available | 1 | 5 |
| Detailed 2D-3D Joint Representation for Human-Object Interaction | Apr 17, 2020 | Action UnderstandingHuman-Object Interaction Detection | CodeCode Available | 1 | 5 |
| Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports | Jan 3, 2024 | Action Understandingcounterfactual | CodeCode Available | 1 | 5 |