| EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding | Jun 13, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 |
| OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding | Jun 11, 2024 | Action UnderstandingDiversity | CodeCode Available | 2 |
| Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond | Jun 5, 2024 | Action RecognitionAction Understanding | CodeCode Available | 0 |
| The SkatingVerse Workshop & Challenge: Methods and Results | May 27, 2024 | Action Understanding | —Unverified | 0 |
| FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality Assessment | May 11, 2024 | Action Quality AssessmentAction Understanding | CodeCode Available | 1 |
| Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning | Apr 8, 2024 | Action UnderstandingDecoder | —Unverified | 0 |
| Enhancing Video Transformers for Action Understanding with VLM-aided Training | Mar 24, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| Impact of Large Language Model Assistance on Patients Reading Clinical Notes: A Mixed-Methods Study | Jan 17, 2024 | Action UnderstandingLanguage Modeling | —Unverified | 0 |
| Multitask Learning in Minimally Invasive Surgical Vision: A Review | Jan 16, 2024 | Action Understanding | —Unverified | 0 |
| Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional Sports | Jan 3, 2024 | Action Understandingcounterfactual | CodeCode Available | 1 |