SOTAVerified

Temporal Localization

Papers

Showing 101150 of 153 papers

TitleStatusHype
Spectro-Temporal RF Identification using Deep Learning0
Spot On: Action Localization from Pointly-Supervised Proposals0
Detection of Sleep Apnea-Hypopnea Events Using Millimeter-wave Radar and Pulse Oximeter0
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector0
Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 20220
Subject Independent Emotion Recognition using EEG Signals Employing Attention Driven Neural Networks0
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives0
Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 20220
Described Spatial-Temporal Video Detection0
Team PKU-WICT-MIPL PIC Makeup Temporal Video Grounding Challenge 2022 Technical Report0
Action recognition in real-world videos0
Temporal Context Network for Activity Localization in Videos0
Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding0
Density-Guided Label Smoothing for Temporal Localization of Driving Actions0
Temporal Localization of Non-Static Digital Videos Using the Electrical Network Frequency0
Deep-Learning-Assisted Analysis of Cataract Surgery Videos0
Text-based Localization of Moments in a Video Corpus0
Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization0
Crash Time Matters: HybridMamba for Fine-Grained Temporal Localization in Traffic Surveillance Footage0
Contrastive Language-Action Pre-training for Temporal Localization0
Weakly-Supervised Temporal Localization via Occurrence Count Learning0
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation0
To catch a chorus, verse, intro, or anything else: Analyzing a song with structural functions0
To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression0
Towards Fine-Grained Video Question Answering0
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding0
Transductive Universal Transport for Zero-Shot Action Recognition0
Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition0
Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization0
Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations0
A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports0
Universal Prototype Transport for Zero-Shot Action Recognition and Localization0
What do I Annotate Next? An Empirical Study of Active Learning for Action Localization0
Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences0
A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus0
Learning to track for spatio-temporal action localization0
Inceptive Event Time-Surfaces for Object Classification Using Neuromorphic Cameras0
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection0
Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors0
Identity-aware Graph Memory Network for Action Detection0
Unsupervised detection and classification of heartbeats using the dissimilarity matrix in PCG signals0
Measure Twice, Cut Once: Grasping Video Structures and Event Semantics with LLMs for Video Temporal Localization0
Fusion of Millimeter-wave Radar and Pulse Oximeter Data for Low-burden Diagnosis of Obstructive Sleep Apnea-Hypopnea Syndrome0
Fine-Tuning Large Audio-Language Models with LoRA for Precise Temporal Localization of Prolonged Exposure Therapy Elements0
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval0
Modality Shifting Attention Network for Multi-modal Video Question Answering0
Modeling Spatio-Temporal Human Track Structure for Action Localization0
A Data Driven End-to-end Approach for In-the-wild Monitoring of Eating Behavior Using Smartwatches0
Few-Shot Transformation of Common Actions into Time and Space0
VADER: Video Alignment Differencing and Retrieval0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.