SOTAVerified

Temporal Action Localization

Temporal Action Localization aims to detect activities in the video stream and output beginning and end timestamps. It is closely related to Temporal Action Proposal Generation.

Papers

Showing 110 of 1477 papers

TitleStatusHype
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action RecognitionCode0
Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition0
Zero-Shot Temporal Interaction Localization for Egocentric VideosCode1
A Review on Coarse to Fine-Grained Animal Action Recognition0
CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization0
DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity RecognitionCode0
ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization0
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?Code0
Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges0
Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer0
Show:102550
← PrevPage 1 of 148Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ActionFormer (SlowFast+Omnivore+EgoVLP)Average mAP21.4Unverified