SOTAVerified

Temporal Action Localization

Temporal Action Localization aims to detect activities in the video stream and output beginning and end timestamps. It is closely related to Temporal Action Proposal Generation.

Papers

Showing 110 of 1477 papers

TitleStatusHype
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action RecognitionCode0
Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition0
Zero-Shot Temporal Interaction Localization for Egocentric VideosCode1
A Review on Coarse to Fine-Grained Animal Action Recognition0
CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization0
DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity RecognitionCode0
ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization0
Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized?Code0
Action Spotting and Precise Event Detection in Sports: Datasets, Methods, and Challenges0
Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer0
Show:102550
← PrevPage 1 of 148Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1RDFA-S6 (InternVideo2-6B)mAP29.6Unverified
2ActionMamba(InternVideo2-6B)mAP29.04Unverified
3InternVideo2-6BmAP27.7Unverified
4DyFADet (VideoMAE v2-g)mAP23.8Unverified
5VideoMAE V2-gmAP18.24Unverified
6InternVideomAP17.57Unverified
7BMN (i3d feaure)mAP9.25Unverified
8G-TAD (i3d feature)mAP9.06Unverified
9DBG (i3d feature)mAP6.75Unverified