SOTAVerified

Temporal Localization

Papers

Showing 126150 of 153 papers

TitleStatusHype
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding0
Transductive Universal Transport for Zero-Shot Action Recognition0
Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition0
Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization0
Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations0
A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports0
Universal Prototype Transport for Zero-Shot Action Recognition and Localization0
What do I Annotate Next? An Empirical Study of Active Learning for Action Localization0
Joint Visual-Temporal Embedding for Unsupervised Learning of Actions in Untrimmed Sequences0
A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus0
Learning to track for spatio-temporal action localization0
Inceptive Event Time-Surfaces for Object Classification Using Neuromorphic Cameras0
Impact of temporal resolution on convolutional recurrent networks for audio tagging and sound event detection0
Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors0
Identity-aware Graph Memory Network for Action Detection0
Unsupervised detection and classification of heartbeats using the dissimilarity matrix in PCG signals0
Measure Twice, Cut Once: Grasping Video Structures and Event Semantics with LLMs for Video Temporal Localization0
Fusion of Millimeter-wave Radar and Pulse Oximeter Data for Low-burden Diagnosis of Obstructive Sleep Apnea-Hypopnea Syndrome0
Fine-Tuning Large Audio-Language Models with LoRA for Precise Temporal Localization of Prolonged Exposure Therapy Elements0
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval0
Modality Shifting Attention Network for Multi-modal Video Question Answering0
Modeling Spatio-Temporal Human Track Structure for Action Localization0
A Data Driven End-to-end Approach for In-the-wild Monitoring of Eating Behavior Using Smartwatches0
Few-Shot Transformation of Common Actions into Time and Space0
VADER: Video Alignment Differencing and Retrieval0
Show:102550
← PrevPage 6 of 7Next →

No leaderboard results yet.