SOTAVerified

Temporal Localization

Papers

Showing 2650 of 153 papers

TitleStatusHype
TimeRefine: Temporal Grounding with Time Refining Video LLMCode0
TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization AbilityCode2
Number it: Temporal Grounding Videos like Flipping MangaCode2
Unsupervised detection and classification of heartbeats using the dissimilarity matrix in PCG signals0
Detection of Sleep Apnea-Hypopnea Events Using Millimeter-wave Radar and Pulse Oximeter0
Training-free Video Temporal Grounding using Large-scale Pre-trained ModelsCode1
Impact of Noisy Labels on Sound Event Detection: Deletion Errors Are More Detrimental Than Insertion Errors0
Described Spatial-Temporal Video Detection0
Meerkat: Audio-Visual Large Language Model for Grounding in Space and TimeCode1
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval0
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow UnderstandingCode2
LITA: Language Instructed Temporal-Localization AssistantCode2
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding0
Skeleton-Based Human Action Recognition with Noisy LabelsCode0
Density-Guided Label Smoothing for Temporal Localization of Driving Actions0
Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition0
OLViT: Multi-Modal State Tracking via Attention-Based Embeddings for Video-Grounded Dialog0
Semi-supervised Active Learning for Video Action DetectionCode0
Deep-Learning-Assisted Analysis of Cataract Surgery Videos0
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video UnderstandingCode2
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives0
Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization0
UnLoc: A Unified Framework for Video Localization TasksCode0
VideoGLUE: Video General Understanding Evaluation of Foundation ModelsCode0
Dense Video Object Captioning from Disjoint SupervisionCode0
Show:102550
← PrevPage 2 of 7Next →

No leaderboard results yet.