SOTAVerified

Temporal Localization

Papers

Showing 101150 of 153 papers

TitleStatusHype
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector0
Structured Video Tokens @ Ego4D PNR Temporal Localization Challenge 20220
Subject Independent Emotion Recognition using EEG Signals Employing Attention Driven Neural Networks0
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives0
Team PKU-WICT-MIPL PIC Makeup Temporal Video Grounding Challenge 2022 Technical Report0
Temporal Context Network for Activity Localization in Videos0
Text-based Localization of Moments in a Video Corpus0
TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation0
To catch a chorus, verse, intro, or anything else: Analyzing a song with structural functions0
To Find Where You Talk: Temporal Sentence Localization in Video with Attention Based Location Regression0
Towards Fine-Grained Video Question Answering0
Transductive Universal Transport for Zero-Shot Action Recognition0
Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition0
Universal Prototype Transport for Zero-Shot Action Recognition and Localization0
Unsupervised detection and classification of heartbeats using the dissimilarity matrix in PCG signals0
VADER: Video Alignment Differencing and Retrieval0
Video Anomaly Detection for Smart Surveillance0
Video Swin Transformers for Egocentric Video Understanding @ Ego4D Challenges 20220
Watch and Learn: Leveraging Expert Knowledge and Language for Surgical Video Understanding0
Weakly-Supervised Temporal Localization via Occurrence Count Learning0
What do I Annotate Next? An Empirical Study of Active Learning for Action Localization0
ATARS: An Aerial Traffic Atomic Activity Recognition and Temporal Segmentation DatasetCode0
Asynchronous Temporal Fields for Action RecognitionCode0
Am I Done? Predicting Action Progress in VideosCode0
When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in VlogsCode0
Hierarchical Deep Residual Reasoning for Temporal Moment LocalizationCode0
Adapting to the Unknown: Training-Free Audio-Visual Event Perception with Dynamic ThresholdsCode0
Skeleton-Based Human Action Recognition with Noisy LabelsCode0
Why Can't I Dance in the Mall? Learning to Mitigate Scene Bias in Action RecognitionCode0
SoftLoc: Robust Temporal Localization under Label MisalignmentCode0
Transforming faces into video stories -- VideoFace2.0Code0
Hierarchical and Multimodal Data for Daily Activity UnderstandingCode0
HACS: Human Action Clips and Segments Dataset for Recognition and Temporal LocalizationCode0
Weakly Supervised Action Localization by Sparse Temporal Pooling NetworkCode0
Semi-supervised Active Learning for Video Action DetectionCode0
UnLoc: A Unified Framework for Video Localization TasksCode0
Do Current Video LLMs Have Strong OCR Abilities? A Preliminary StudyCode0
Accelerating COVID-19 Differential Diagnosis with Explainable Ultrasound Image AnalysisCode0
Dense Video Object Captioning from Disjoint SupervisionCode0
TadML: A fast temporal action detection with Mechanics-MLPCode0
RefineLoc: Iterative Refinement for Weakly-Supervised Action LocalizationCode0
Weakly Supervised Multiple Instance Learning for Whale Call Detection and Temporal Localization in Long-Duration Passive Acoustic MonitoringCode0
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNsCode0
A Bottom-up method Towards the Automatic and Objective Monitoring of Smoking Behavior In-the-wild using Wrist-mounted Inertial SensorsCode0
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web ImagesCode0
Online Human Action Detection using Joint Classification-Regression Recurrent Neural NetworksCode0
Technical Report of the Video Event Reconstruction and Analysis (VERA) System -- Shooter Localization, Models, Interface, and BeyondCode0
NAAQA: A Neural Architecture for Acoustic Question AnsweringCode0
VideoGLUE: Video General Understanding Evaluation of Foundation ModelsCode0
Multi-attention Networks for Temporal Localization of Video-level LabelsCode0
Show:102550
← PrevPage 3 of 4Next →

No leaderboard results yet.