| SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis | Jun 9, 2025 | Action ClassificationBenchmarking | —Unverified | 0 |
| From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos | Jun 5, 2025 | Action ClassificationComposed Video Retrieval (CoVR) | CodeCode Available | 0 |
| Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition | May 29, 2025 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding | May 22, 2025 | Action ClassificationAutomatic Speech Recognition | CodeCode Available | 0 |
| Mouse Lockbox Dataset: Behavior Recognition for Mice Solving Lockboxes | May 21, 2025 | Action ClassificationPose Tracking | —Unverified | 0 |
| Domain Adaptation of VLM for Soccer Video Understanding | May 20, 2025 | Action ClassificationDomain Adaptation | —Unverified | 0 |
| CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition | Mar 30, 2025 | Action ClassificationAction Recognition | —Unverified | 0 |
| OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition | Mar 30, 2025 | Action ClassificationAction Recognition | —Unverified | 0 |
| Make Your Training Flexible: Towards Deployment-Efficient Video Models | Mar 18, 2025 | Action ClassificationZero-Shot Video Retrieval | CodeCode Available | 1 |
| Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues | Feb 1, 2025 | Action ClassificationAction Localization | —Unverified | 0 |
| DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification | Jan 1, 2025 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| BoxMAC -- A Boxing Dataset for Multi-label Action Classification | Dec 24, 2024 | Action Classification | —Unverified | 0 |
| FACTS: Fine-Grained Action Classification for Tactical Sports | Dec 21, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| Scaling 4D Representations | Dec 19, 2024 | Action ClassificationCamera Pose Estimation | —Unverified | 0 |
| Stitch Contrast and Segment_Learning a Human Action Segmentation Model Using Trimmed Skeleton Videos | Dec 19, 2024 | Action ClassificationAction Localization | —Unverified | 0 |
| Temporal Action Localization with Cross Layer Task Decoupling and Refinement | Dec 12, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Mining Limited Data Sufficiently: A BERT-inspired Approach for CSI Time Series Application in Wireless Communication and Sensing | Dec 9, 2024 | Action ClassificationDeep Learning | —Unverified | 0 |
| KNN-MMD: Cross Domain Wireless Sensing via Local Distribution Alignment | Dec 6, 2024 | Action ClassificationAction Classification (1-shot) | CodeCode Available | 1 |
| Proximal Control of UAVs with Federated Learning for Human-Robot Collaborative Domains | Dec 3, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| Towards Universal Soccer Video Understanding | Dec 2, 2024 | Action ClassificationSports Understanding | CodeCode Available | 3 |
| OccludeNet: A Causal Journey into Mixed-View Actor-Centric Video Action Recognition under Occlusions | Nov 24, 2024 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Multilevel semantic and adaptive actionness learning for weakly supervised temporal action localization | Nov 24, 2024 | Action ClassificationAction Localization | CodeCode Available | 0 |
| ACE: Action Concept Enhancement of Video-Language Models in Procedural Videos | Nov 23, 2024 | Action ClassificationClassification | —Unverified | 0 |
| IMUVIE: Pickup Timeline Action Localization via Motion Movies | Nov 19, 2024 | Action ClassificationAction Localization | —Unverified | 0 |
| Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors | Nov 17, 2024 | Action ClassificationSegmentation | CodeCode Available | 0 |
| Autoregressive Adaptive Hypergraph Transformer for Skeleton-based Activity Recognition | Nov 8, 2024 | Action ClassificationActivity Recognition | CodeCode Available | 1 |
| AM Flow: Adapters for Temporal Processing in Action Recognition | Nov 4, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| Learning Video Representations without Natural Videos | Oct 31, 2024 | Action ClassificationDiversity | —Unverified | 0 |
| YourSkatingCoach: A Figure Skating Video Benchmark for Fine-Grained Element Analysis | Oct 27, 2024 | Action Classification | —Unverified | 0 |
| Are Visual-Language Models Effective in Action Recognition? A Comparative Study | Oct 22, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| Dual-Model Distillation for Efficient Action Classification with Hybrid Edge-Cloud Solution | Oct 16, 2024 | Action Classification | —Unverified | 0 |
| Multi class activity classification in videos using Motion History Image generation | Oct 13, 2024 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Language-Assisted Human Part Motion Learning for Skeleton-Based Temporal Action Segmentation | Oct 8, 2024 | Action ClassificationAction Segmentation | CodeCode Available | 0 |
| CrossFi: A Cross Domain Wi-Fi Sensing Framework Based on Siamese Network | Aug 20, 2024 | Action ClassificationAction Classification (1-shot) | CodeCode Available | 1 |
| Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization | Aug 12, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 |
| EPAM-Net: An Efficient Pose-driven Attention-guided Multimodal Network for Video Action Recognition | Aug 10, 2024 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Self-Supervised Video Representation Learning in a Heuristic Decoupled Perspective | Jul 19, 2024 | Action ClassificationContrastive Learning | —Unverified | 0 |
| Do You Act Like You Talk? Exploring Pose-based Driver Action Classification with Speech Recognition Networks | Jul 15, 2024 | Action ClassificationData Augmentation | CodeCode Available | 0 |
| Open Vocabulary Multi-Label Video Classification | Jul 12, 2024 | Action ClassificationClassification | —Unverified | 0 |
| Dark Transformer: A Video Transformer for Action Recognition in the Dark | Jun 25, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding | Jun 13, 2024 | Action ClassificationAction Localization | CodeCode Available | 1 |
| Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition | Apr 30, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| Learning Correlation Structures for Vision Transformers | Apr 5, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| Enhancing Video Transformers for Action Understanding with VLM-aided Training | Mar 24, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| InternVideo2: Scaling Foundation Models for Multimodal Video Understanding | Mar 22, 2024 | Action ClassificationAction Recognition | CodeCode Available | 7 |
| Finding the Missing Data: A BERT-inspired Approach Against Package Loss in Wireless Sensing | Mar 19, 2024 | Action ClassificationDeep Learning | CodeCode Available | 1 |
| VideoMamba: State Space Model for Efficient Video Understanding | Mar 11, 2024 | Action ClassificationMamba | CodeCode Available | 5 |
| Classification of Tennis Actions Using Deep Learning | Feb 4, 2024 | Action ClassificationClassification | —Unverified | 0 |
| Robustness Evaluation of Machine Learning Models for Robot Arm Action Recognition in Noisy Environments | Jan 17, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| OmniVec2 - A Novel Transformer based Network for Large Scale Multimodal and Multitask Learning | Jan 1, 2024 | 3D Point Cloud ClassificationAction Classification | —Unverified | 0 |