SOTAVerified

Temporal Localization

Papers

Showing 126150 of 153 papers

TitleStatusHype
Hierarchical Deep Residual Reasoning for Temporal Moment LocalizationCode0
Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic ThresholdsCode0
Skeleton-Based Human Action Recognition with Noisy LabelsCode0
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action RecognitionCode0
SoftLoc: Robust Temporal Localization under Label MisalignmentCode0
Transforming faces into video stories -- VideoFace2.0Code0
Hierarchical and Multimodal Data for Daily Activity UnderstandingCode0
HACS: Human Action Clips and Segments Dataset for Recognition and Temporal LocalizationCode0
Weakly Supervised Action Localization by Sparse Temporal Pooling NetworkCode0
Semi-supervised Active Learning for Video Action DetectionCode0
UnLoc: A Unified Framework for Video Localization TasksCode0
Do Current Video LLMs Have Strong OCR Abilities? A Preliminary StudyCode0
Accelerating COVID-19 Differential Diagnosis with Explainable Ultrasound Image AnalysisCode0
Dense Video Object Captioning from Disjoint SupervisionCode0
TadML: A fast temporal action detection with Mechanics-MLPCode0
RefineLoc: Iterative Refinement for Weakly-Supervised Action LocalizationCode0
Weakly Supervised Multiple Instance Learning for Whale Call Detection and Temporal Localization in Long-Duration Passive Acoustic MonitoringCode0
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNsCode0
A Bottom-up method Towards the Automatic and Objective Monitoring of Smoking Behavior In-the-wild using Wrist-mounted Inertial SensorsCode0
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web ImagesCode0
Online Human Action Detection using Joint Classification-Regression Recurrent Neural NetworksCode0
Technical Report of the Video Event Reconstruction and Analysis (VERA) System -- Shooter Localization, Models, Interface, and BeyondCode0
NAAQA: A Neural Architecture for Acoustic Question AnsweringCode0
VideoGLUE: Video General Understanding Evaluation of Foundation ModelsCode0
Multi-attention Networks for Temporal Localization of Video-level LabelsCode0
Show:102550
← PrevPage 6 of 7Next →

No leaderboard results yet.