| OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos | Feb 10, 2022 | Action LocalizationTemporal Action Localization | —Unverified | 0 | 0 |
| PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization | Mar 9, 2021 | Action LocalizationBoundary Detection | —Unverified | 0 | 0 |
| Pointly-Supervised Action Localization | May 29, 2018 | Action LocalizationMultiple Instance Learning | —Unverified | 0 | 0 |
| Poselet Key-Framing: A Model for Human Activity Recognition | Jun 1, 2013 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 | 0 |
| Practitioner-Centric Approach for Early Incident Detection Using Crowdsourced Data for Emergency Services | Dec 3, 2021 | Event DetectionManagement | —Unverified | 0 | 0 |
| Pseudo Strong Labels from Frame-Level Predictions for Weakly Supervised Sound Event Detection | Jan 7, 2025 | Event DetectionSound Event Detection | —Unverified | 0 | 0 |
| ReActNet: Temporal Localization of Repetitive Activities in Real-World Videos | Oct 14, 2019 | Temporal Localization | —Unverified | 0 | 0 |
| Activity Recognition on a Large Scale in Short Videos - Moments in Time Dataset | Sep 1, 2018 | Action RecognitionActivity Recognition | —Unverified | 0 | 0 |
| Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos | Sep 18, 2020 | cross-modal alignmentreinforcement-learning | —Unverified | 0 | 0 |
| Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes | Jun 16, 2022 | Temporal Localization | —Unverified | 0 | 0 |