SOTAVerified

Action Localization

Action Localization is finding the spatial and temporal co ordinates for an action in a video. An action localization model will identify which frame an action start and ends in video and return the x,y coordinates of an action. Further the co ordinates will change when the object performing action undergoes a displacement.

Papers

Showing 51100 of 369 papers

TitleStatusHype
Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action LocalizationCode1
TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long VideosCode1
Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action LocalizationCode1
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization TasksCode1
Few-Shot Temporal Action Localization with Query Adaptive TransformerCode1
Everything at Once - Multi-Modal Fusion Transformer for Video RetrievalCode1
Boosting Weakly-Supervised Temporal Action Localization with Text InformationCode1
VideoMix: Rethinking Data Augmentation for Video ClassificationCode1
Weakly Supervised Temporal Action Localization Using Deep Metric LearningCode1
Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge PropagationCode1
BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal GenerationCode1
FineAction: A Fine-Grained Video Dataset for Temporal Action LocalizationCode1
Enhancing Temporal Action Localization: Advanced S6 Modeling with Recurrent MechanismCode1
ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action LocalizationCode1
Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action LocalizationCode1
Multi-Granularity Hand Action DetectionCode1
CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)Code1
Actionness Inconsistency-guided Contrastive Learning for Weakly-supervised Temporal Action LocalizationCode1
A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action LocalizationCode1
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI NavigationCode1
Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action LocalizationCode1
Chaotic World: A Large and Challenging Benchmark for Human Behavior Understanding in Chaotic EventsCode1
Learning Action Completeness from Points for Weakly-supervised Temporal Action LocalizationCode1
Learning Salient Boundary Feature for Anchor-free Temporal Action LocalizationCode1
CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive LearningCode1
Localizing the Common Action Among a Few VideosCode1
E^2TAD: An Energy-Efficient Tracking-based Action DetectorCode1
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled VideosCode1
Bottom-Up Temporal Action Localization with Mutual RegularizationCode1
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action UnderstandingCode1
ASTRA: An Action Spotting TRAnsformer for Soccer VideosCode1
Convex Combination Consistency between Neighbors for Weakly-supervised Action LocalizationCode1
Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action LocalizationCode1
Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action LocalizationCode1
Realigning Confidence with Temporal Saliency Information for Point-Level Weakly-Supervised Temporal Action LocalizationCode1
Cross-modal Consensus Network for Weakly Supervised Temporal Action LocalizationCode1
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video ClipsCode1
Cross-modal Consensus Network forWeakly Supervised Temporal Action LocalizationCode1
Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based ApproachCode1
Set-Supervised Action Learning in Procedural Task Videos via Pairwise Order ConsistencyCode1
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual ActionsCode1
ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming VideosCode1
OpenTAL: Towards Open Set Temporal Action LocalizationCode1
BABEL: Bodies, Action and Behavior with English LabelsCode1
Background-Click Supervision for Temporal Action LocalizationCode1
1st place solution for AVA-Kinetics Crossover in AcitivityNet Challenge 2020Code1
Weakly-supervised Temporal Action Localization by Uncertainty ModelingCode1
Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition0
Adaptive Perception Transformer for Temporal Action Localization0
Bridge the Gap: From Weak to Full Supervision for Temporal Action Localization with PseudoFormer0
Show:102550
← PrevPage 2 of 8Next →

No leaderboard results yet.