| Weakly Supervised Multiple Instance Learning for Whale Call Detection and Temporal Localization in Long-Duration Passive Acoustic Monitoring | Feb 28, 2025 | Multiple Instance LearningTemporal Localization | CodeCode Available | 0 | 5 |
| When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in Vlogs | Feb 16, 2022 | Action LocalizationTemporal Action Localization | CodeCode Available | 0 | 5 |
| Technical Report of the Video Event Reconstruction and Analysis (VERA) System -- Shooter Localization, Models, Interface, and Beyond | May 26, 2019 | Gunshot DetectionShooter Localization | CodeCode Available | 0 | 5 |
| Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022 | Nov 16, 2022 | Human-Object Interaction DetectionObject | —Unverified | 0 | 0 |
| Objects2action: Classifying and localizing actions without any video example | Oct 23, 2015 | AttributeObject | —Unverified | 0 | 0 |
| OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog | Feb 20, 2024 | ObjectObject Tracking | —Unverified | 0 | 0 |
| AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization | Nov 27, 2019 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| Efficient Action Localization with Approximately Normalized Fisher Vectors | Jun 1, 2014 | Action LocalizationAction Recognition | —Unverified | 0 | 0 |
| Efficient Action Detection in Untrimmed Videos via Multi-Task Learning | Dec 22, 2016 | Action DetectionAction Localization | —Unverified | 0 | 0 |
| Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection | Oct 18, 2022 | Event DetectionSound Event Detection | —Unverified | 0 | 0 |