| Single-Stage Visual Query Localization in Egocentric Videos | Jun 15, 2023 | object-detectionObject Detection | —Unverified | 0 |
| Temporal Localization of Non-Static Digital Videos Using the Electrical Network Frequency | Apr 20, 2020 | ENF (Electric Network Frequency) Extraction from VideoTemporal Localization | —Unverified | 0 |
| A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes | Feb 3, 2022 | Data AugmentationEvent Detection | —Unverified | 0 |
| Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization | Dec 1, 2013 | Action LocalizationClassification | —Unverified | 0 |
| Action recognition in real-world videos | Apr 22, 2020 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| Action Shuffling for Weakly Supervised Temporal Localization | May 10, 2021 | Action LocalizationTemporal Localization | —Unverified | 0 |
| Activity Recognition on a Large Scale in Short Videos - Moments in Time Dataset | Sep 1, 2018 | Action RecognitionActivity Recognition | —Unverified | 0 |
| AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization | Nov 27, 2019 | Action ClassificationAction Recognition | —Unverified | 0 |
| A Data Driven End-to-end Approach for In-the-wild Monitoring of Eating Behavior Using Smartwatches | Oct 12, 2020 | Temporal Localization | —Unverified | 0 |
| A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus | Nov 18, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports | Apr 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations | May 10, 2023 | Template MatchingTemporal Localization | —Unverified | 0 |
| Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding | Mar 24, 2024 | Dense Video CaptioningTemporal Localization | —Unverified | 0 |
| Contrastive Language-Action Pre-training for Temporal Localization | Apr 26, 2022 | Action LocalizationContrastive Learning | —Unverified | 0 |
| Crash Time Matters: HybridMamba for Fine-Grained Temporal Localization in Traffic Surveillance Footage | Apr 4, 2025 | Temporal Localization | —Unverified | 0 |
| Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization | Aug 24, 2023 | Action LocalizationContrastive Learning | —Unverified | 0 |
| Deep-Learning-Assisted Analysis of Cataract Surgery Videos | Dec 10, 2023 | Decision MakingDeep Learning | —Unverified | 0 |
| Density-Guided Label Smoothing for Temporal Localization of Driving Actions | Mar 11, 2024 | Action LocalizationAction Recognition | —Unverified | 0 |
| Described Spatial-Temporal Video Detection | Jul 8, 2024 | Multi-class ClassificationTemporal Localization | —Unverified | 0 |
| Detection of Sleep Apnea-Hypopnea Events Using Millimeter-wave Radar and Pulse Oximeter | Sep 28, 2024 | Temporal Localization | —Unverified | 0 |
| Efficient Action Detection in Untrimmed Videos via Multi-Task Learning | Dec 22, 2016 | Action DetectionAction Localization | —Unverified | 0 |
| Efficient Action Localization with Approximately Normalized Fisher Vectors | Jun 1, 2014 | Action LocalizationAction Recognition | —Unverified | 0 |
| Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 2022 | Nov 16, 2022 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Exploring Temporal Preservation Networks for Precise Temporal Action Localization | Aug 10, 2017 | Action LocalizationOpen-Ended Question Answering | —Unverified | 0 |
| Few-Shot Transformation of Common Actions into Time and Space | Apr 6, 2021 | Action LocalizationDecoder | —Unverified | 0 |
| Fine-Tuning Large Audio-Language Models with LoRA for Precise Temporal Localization of Prolonged Exposure Therapy Elements | Jun 11, 2025 | Temporal Localization | —Unverified | 0 |
| Fusion of Millimeter-wave Radar and Pulse Oximeter Data for Low-burden Diagnosis of Obstructive Sleep Apnea-Hypopnea Syndrome | Jan 25, 2025 | DiagnosticSleep Staging | —Unverified | 0 |
| Identity-aware Graph Memory Network for Action Detection | Aug 26, 2021 | Action DetectionGraph Neural Network | —Unverified | 0 |
| Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors | Aug 27, 2024 | Event DetectionSound Event Detection | —Unverified | 0 |
| Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection | Sep 26, 2022 | Audio TaggingEvent Detection | —Unverified | 0 |
| Inceptive Event Time-Surfaces for Object Classification Using Neuromorphic Cameras | Feb 26, 2020 | ClassificationDimensionality Reduction | —Unverified | 0 |
| Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences | Jan 29, 2020 | Action RecognitionAction Segmentation | —Unverified | 0 |
| Learning to track for spatio-temporal action localization | Jun 5, 2015 | Action LocalizationSpatio-Temporal Action Localization | —Unverified | 0 |
| Measure Twice, Cut Once: Grasping Video Structures and Event Semantics with LLMs for Video Temporal Localization | Mar 12, 2025 | Temporal LocalizationVideo Understanding | —Unverified | 0 |
| MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval | Jun 25, 2024 | cross-modal alignmentMoment Retrieval | —Unverified | 0 |
| Modality Shifting Attention Network for Multi-modal Video Question Answering | Jul 4, 2020 | Question AnsweringTemporal Localization | —Unverified | 0 |
| Modeling Spatio-Temporal Human Track Structure for Action Localization | Jun 28, 2018 | Action LocalizationHuman Detection | —Unverified | 0 |
| Objects2action: Classifying and localizing actions without any video example | Oct 23, 2015 | AttributeObject | —Unverified | 0 |
| OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog | Feb 20, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection | Oct 18, 2022 | Event DetectionSound Event Detection | —Unverified | 0 |
| OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos | Feb 10, 2022 | Action LocalizationTemporal Action Localization | —Unverified | 0 |
| PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization | Mar 9, 2021 | Action LocalizationBoundary Detection | —Unverified | 0 |
| Pointly-Supervised Action Localization | May 29, 2018 | Action LocalizationMultiple Instance Learning | —Unverified | 0 |
| Poselet Key-Framing: A Model for Human Activity Recognition | Jun 1, 2013 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 |
| Practitioner-Centric Approach for Early Incident Detection Using Crowdsourced Data for Emergency Services | Dec 3, 2021 | Event DetectionManagement | —Unverified | 0 |
| SocialGesture: Delving into Multi-person Gesture Understanding | Apr 3, 2025 | Gesture RecognitionQuestion Answering | —Unverified | 0 |
| Spatio-Temporal Attention Models for Grounded Video Captioning | Oct 17, 2016 | image-classificationImage Classification | —Unverified | 0 |
| Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding | Mar 28, 2023 | Action LocalizationAction Recognition | —Unverified | 0 |
| Spectro-Temporal RF Identification using Deep Learning | Jul 11, 2021 | Deep Learningobject-detection | —Unverified | 0 |
| Spot On: Action Localization from Pointly-Supervised Proposals | Apr 26, 2016 | Action LocalizationMultiple Instance Learning | —Unverified | 0 |