| Detection of Sleep Apnea-Hypopnea Events Using Millimeter-wave Radar and Pulse Oximeter | Sep 28, 2024 | Temporal Localization | —Unverified | 0 | 0 |
| Action Shuffling for Weakly Supervised Temporal Localization | May 10, 2021 | Action LocalizationTemporal Localization | —Unverified | 0 | 0 |
| Sequential End-to-End Intent and Slot Label Classification and Localization | Jun 8, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries | Dec 17, 2024 | Human Detectionimage-classification | —Unverified | 0 | 0 |
| Single-Stage Visual Query Localization in Egocentric Videos | Jun 15, 2023 | object-detectionObject Detection | —Unverified | 0 | 0 |
| Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 2022 | Jul 22, 2022 | ObjectObject State Change Classification | —Unverified | 0 | 0 |
| SocialGesture: Delving into Multi-person Gesture Understanding | Apr 3, 2025 | Gesture RecognitionQuestion Answering | —Unverified | 0 | 0 |
| Action recognition in real-world videos | Apr 22, 2020 | Action RecognitionTemporal Action Localization | —Unverified | 0 | 0 |
| Spatio-Temporal Attention Models for Grounded Video Captioning | Oct 17, 2016 | image-classificationImage Classification | —Unverified | 0 | 0 |
| Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding | Mar 28, 2023 | Action LocalizationAction Recognition | —Unverified | 0 | 0 |