SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 201250 of 369 papers

TitleStatusHype
Cricket stroke extraction: Towards creation of a large-scale cricket actions dataset0
Multi-modal Prompting for Low-Shot Temporal Action Localization0
Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization0
A Better Baseline for AVA0
Co-Occurrence Matters: Learning Action Relation for Temporal Action Localization0
Contrastive Language-Action Pre-training for Temporal Localization0
MVP: Robust Multi-View Practice for Driving Action Localization0
Representation Learning on Visual-Symbolic Graphs for Video Understanding0
Complementary Boundary Generator with Scale-Invariant Relation Modeling for Temporal Action Localization: Submission to ActivityNet Challenge 20200
Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling0
One-Shot Action Localization by Learning Sequence Matching Network0
CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization0
Open-Vocabulary Temporal Action Localization using Multimodal Guidance0
Class Semantics-based Attention for Action Detection0
OWL (Observe, Watch, Listen): Audiovisual Temporal Context for Localizing Actions in Egocentric Videos0
PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization0
Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization0
PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization0
PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization0
Point3D: tracking actions as moving points with 3D CNNs0
Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses0
Pointly-Supervised Action Localization0
POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization0
Precise Temporal Action Localization by Evolving Temporal Proposals0
Predicting the Where and What of Actors and Actions Through Online Action Localization0
Prior-enhanced Temporal Action Localization using Subject-aware Spatial Attention0
CFAD: Coarse-to-Fine Action Detector for Spatiotemporal Action Localization0
Progress Regression RNN for Online Spatial-Temporal Action Localization in Unconstrained Videos0
Cascade Evidential Learning for Open-World Weakly-Supervised Temporal Action Localization0
Proposal-based Temporal Action Localization with Point-level Supervision0
ProTAL: A Drag-and-Link Video Programming Framework for Temporal Action Localization0
Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization0
Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization0
Can MLLMs Guide Weakly-Supervised Temporal Action Localization Tasks?0
Real-time Spatio-temporal Action Localization via Learning Motion Representation0
CAG-QIL: Context-Aware Actionness Grouping via Q Imitation Learning for Online Temporal Action Localization0
Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition0
Online Temporal Action Localization with Memory-Augmented Transformer0
Relation Modeling in Spatio-Temporal Action Localization0
Weakly-supervised Action Localization with Background Modeling0
Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization0
Rethinking Pseudo-Label Guided Learning for Weakly Supervised Temporal Action Localization from the Perspective of Noise Correction0
Rethinking the Faster R-CNN Architecture for Temporal Action Localization0
Rethinking Top Probability from Multi-view for Distracted Driver Behaviour Localization0
Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer0
Boundary Uncertainty in a Single-Stage Temporal Action Localization Network0
Revisiting Kernel Temporal Segmentation as an Adaptive Tokenizer for Long-form Video Understanding0
YH Technologies at ActivityNet Challenge 20180
SALAD: Self-Assessment Learning for Action Detection0
Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos0
Show:102550
← PrevPage 5 of 8Next →

No leaderboard results yet.