SOTAVerified

Action Classification

Papers

Showing 150 of 457 papers

TitleStatusHype
SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis0
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained VideosCode0
Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action RecognitionCode0
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
Mouse Lockbox Dataset: Behavior Recognition for Mice Solving Lockboxes0
Domain Adaptation of VLM for Soccer Video Understanding0
CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition0
OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition0
Make Your Training Flexible: Towards Deployment-Efficient Video ModelsCode1
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues0
DejaVid: Encoder-Agnostic Learned Temporal Matching for Video ClassificationCode0
BoxMAC -- A Boxing Dataset for Multi-label Action Classification0
FACTS: Fine-Grained Action Classification for Tactical Sports0
Scaling 4D Representations0
Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos0
Temporal Action Localization with Cross Layer Task Decoupling and RefinementCode1
Mining Limited Data Sufficiently: A BERT-inspired Approach for CSI Time Series Application in Wireless Communication and Sensing0
KNN-MMD: Cross Domain Wireless Sensing via Local Distribution AlignmentCode1
Proximal Control of UAVs with Federated Learning for Human-Robot Collaborative Domains0
Towards Universal Soccer Video UnderstandingCode3
OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under OcclusionsCode0
Multilevel semantic and adaptive actionness learning for weakly supervised temporal action localizationCode0
ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos0
IMUVIE: Pickup Timeline Action Localization via Motion Movies0
Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR SensorsCode0
Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity RecognitionCode1
AM Flow: Adapters for Temporal Processing in Action Recognition0
Learning Video Representations without Natural Videos0
YourSkatingCoach: A Figure Skating Video Benchmark for Fine-Grained Element Analysis0
Are Visual-Language Models Effective in Action Recognition? A Comparative Study0
Dual-Model Distillation for Efficient Action Classification with Hybrid Edge-Cloud Solution0
Multi class activity classification in videos using Motion History Image generationCode0
Language-Assisted Human Part Motion Learning for Skeleton-Based Temporal Action SegmentationCode0
CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese NetworkCode1
Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action LocalizationCode1
EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action RecognitionCode1
Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective0
Do You Act Like You Talk? Exploring Pose-based Driver Action Classification with Speech Recognition NetworksCode0
Open Vocabulary Multi-Label Video Classification0
Dark Transformer: A Video Transformer for Action Recognition in the Dark0
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action UnderstandingCode1
Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition0
Learning Correlation Structures for Vision Transformers0
Enhancing Video Transformers for Action Understanding with VLM-aided Training0
InternVideo2: Scaling Foundation Models for Multimodal Video UnderstandingCode7
Finding the Missing Data: A BERT-inspired Approach Against Package Loss in Wireless SensingCode1
VideoMamba: State Space Model for Efficient Video UnderstandingCode5
Classification of Tennis Actions Using Deep Learning0
Robustness Evaluation of Machine Learning Models for Robot Arm Action Recognition in Noisy Environments0
OmniVec2 - A Novel Transformer based Network for Large Scale Multimodal and Multitask Learning0
Show:102550
← PrevPage 1 of 10Next →

No leaderboard results yet.