| LLaVAction: evaluating and training multi-modal large language models for action recognition | Mar 24, 2025 | Action RecognitionAction Understanding | CodeCode Available | 2 | 5 |
| OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding | Jun 11, 2024 | Action UnderstandingDiversity | CodeCode Available | 2 | 5 |
| Home Action Genome: Cooperative Compositional Action Understanding | May 11, 2021 | Action RecognitionAction Understanding | CodeCode Available | 1 | 5 |
| EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding | Jun 13, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 | 5 |
| FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment | May 11, 2024 | Action Quality AssessmentAction Understanding | CodeCode Available | 1 | 5 |
| FineSports: A Multi-person Hierarchical Sports Video Dataset for Fine-grained Action Understanding | Jan 1, 2024 | Action AnalysisAction Understanding | CodeCode Available | 1 | 5 |
| Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos | Mar 26, 2022 | Action SegmentationAction Understanding | CodeCode Available | 1 | 5 |
| Action Quality Assessment with Temporal Parsing Transformer | Jul 19, 2022 | Action Quality AssessmentAction Understanding | CodeCode Available | 1 | 5 |
| Domain Knowledge-Informed Self-Supervised Representations for Workout Form Assessment | Feb 28, 2022 | 3D Action RecognitionAction Analysis | CodeCode Available | 1 | 5 |
| Detailed 2D-3D Joint Representation for Human-Object Interaction | Apr 17, 2020 | Action UnderstandingHuman-Object Interaction Detection | CodeCode Available | 1 | 5 |