| Scaling Open-Vocabulary Action Detection | Apr 4, 2025 | Action DetectionMultiple Action Detection | CodeCode Available | 0 |
| Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues | Feb 1, 2025 | Action ClassificationAction Localization | —Unverified | 0 |
| Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives | Sep 21, 2023 | Action LocalizationAction Recognition | —Unverified | 0 |
| End-to-End Spatio-Temporal Action Localisation with Video Transformers | Apr 24, 2023 | Action DetectionAction Recognition | —Unverified | 0 |
| VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking | Mar 29, 2023 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Unmasked Teacher: Towards Training-Efficient Video Foundation Models | Mar 28, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Unified Keypoint-based Action Recognition Framework via Structured Keypoint Pooling | Mar 27, 2023 | Action LocalizationAction Recognition | —Unverified | 0 |
| InternVideo: General Video Foundation Models via Generative and Discriminative Learning | Dec 6, 2022 | Action ClassificationAction Recognition | CodeCode Available | 4 |
| E^2TAD: An Energy-Efficient Tracking-based Action Detector | Apr 9, 2022 | Action DetectionAction Localization | CodeCode Available | 1 |
| MM-SEAL: A Large-scale Video Dataset of Multi-person Multi-grained Spatio-temporally Action Localization | Apr 6, 2022 | Action LocalizationAction Recognition | —Unverified | 0 |