SOTAVerified

Temporal Localization

Papers

Showing 51100 of 153 papers

TitleStatusHype
Single-Stage Visual Query Localization in Egocentric Videos0
Temporal Localization of Non-Static Digital Videos Using the Electrical Network Frequency0
A benchmark of state-of-the-art sound event detection systems evaluated on synthetic soundscapes0
Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization0
Action recognition in real-world videos0
Action Shuffling for Weakly Supervised Temporal Localization0
Activity Recognition on a Large Scale in Short Videos - Moments in Time Dataset0
AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization0
A Data Driven End-to-end Approach for In-the-wild Monitoring of Eating Behavior Using Smartwatches0
A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus0
A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports0
Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations0
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding0
Contrastive Language-Action Pre-training for Temporal Localization0
Crash Time Matters: HybridMamba for Fine-Grained Temporal Localization in Traffic Surveillance Footage0
Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization0
Deep-Learning-Assisted Analysis of Cataract Surgery Videos0
Density-Guided Label Smoothing for Temporal Localization of Driving Actions0
Described Spatial-Temporal Video Detection0
Detection of Sleep Apnea-Hypopnea Events Using Millimeter-wave Radar and Pulse Oximeter0
Efficient Action Detection in Untrimmed Videos via Multi-Task Learning0
Efficient Action Localization with Approximately Normalized Fisher Vectors0
Exploring State Change Capture of Heterogeneous Backbones @ Ego4D Hands and Objects Challenge 20220
Exploring Temporal Preservation Networks for Precise Temporal Action Localization0
Few-Shot Transformation of Common Actions into Time and Space0
Fine-Tuning Large Audio-Language Models with LoRA for Precise Temporal Localization of Prolonged Exposure Therapy Elements0
Fusion of Millimeter-wave Radar and Pulse Oximeter Data for Low-burden Diagnosis of Obstructive Sleep Apnea-Hypopnea Syndrome0
Identity-aware Graph Memory Network for Action Detection0
Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors0
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection0
Inceptive Event Time-Surfaces for Object Classification Using Neuromorphic Cameras0
Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences0
Learning to track for spatio-temporal action localization0
Measure Twice, Cut Once: Grasping Video Structures and Event Semantics with LLMs for Video Temporal Localization0
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval0
Modality Shifting Attention Network for Multi-modal Video Question Answering0
Modeling Spatio-Temporal Human Track Structure for Action Localization0
Objects2action: Classifying and localizing actions without any video example0
OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog0
Optimizing Temporal Resolution Of Convolutional Recurrent Neural Networks For Sound Event Detection0
OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos0
PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization0
Pointly-Supervised Action Localization0
Poselet Key-Framing: A Model for Human Activity Recognition0
Practitioner-Centric Approach for Early Incident Detection Using Crowdsourced Data for Emergency Services0
SocialGesture: Delving into Multi-person Gesture Understanding0
Spatio-Temporal Attention Models for Grounded Video Captioning0
Structured Video-Language Modeling with Temporal Grouping and Spatial Grounding0
Spectro-Temporal RF Identification using Deep Learning0
Spot On: Action Localization from Pointly-Supervised Proposals0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.