A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains Jul 17, 2025 Action Recognition Hand-Object Interaction Detection
— Unverified 0Zero-shot Skeleton-based Action Recognition with Prototype-guided Feature Alignment Jul 1, 2025 Action Recognition One-Shot 3D Action Recognition
Code Code Available 1EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception Jun 26, 2025 Action Recognition Active Speaker Localization
— Unverified 0Feature Hallucination for Self-supervised Action Recognition Jun 25, 2025 Action Recognition Hallucination
— Unverified 0CARMA: Context-Aware Situational Grounding of Human-Robot Group Interactions by Combining Vision-Language Models with Object and Action Recognition Jun 25, 2025 Action Recognition Decision Making
— Unverified 0Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition Jun 23, 2025 Action Recognition Skeleton Based Action Recognition
— Unverified 0Adapting Vision-Language Models for Evaluating World Models Jun 22, 2025 Action Recognition Multimodal Reasoning
— Unverified 0Active Multimodal Distillation for Few-shot Action Recognition Jun 16, 2025 Action Recognition Few-Shot action recognition
— Unverified 0Time-Unified Diffusion Policy with Action Discrimination for Robotic Manipulation Jun 11, 2025 Action Generation Action Recognition
— Unverified 0An Effective End-to-End Solution for Multimodal Action Recognition Jun 11, 2025 Action Recognition Computational Efficiency
— Unverified 0HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person Scenarios Jun 11, 2025 Action Recognition Action Segmentation
Code Code Available 0ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding Jun 2, 2025 Action Recognition Video Understanding
— Unverified 0EPFL-Smart-Kitchen-30: Densely annotated cooking dataset with 3D kinematics to challenge video and language models Jun 2, 2025 Action Recognition Action Segmentation
Code Code Available 1A Review on Coarse to Fine-Grained Animal Action Recognition Jun 1, 2025 Action Recognition Animal Action Recognition
— Unverified 03D Skeleton-Based Action Recognition: A Review Jun 1, 2025 Action Recognition Data Augmentation
— Unverified 0EgoExOR: An Ego-Exo-Centric Operating Room Dataset for Surgical Activity Understanding May 30, 2025 Action Recognition Graph Generation
Code Code Available 1Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition May 29, 2025 Action Classification Action Recognition
Code Code Available 0The Role of Video Generation in Enhancing Data-Limited Action Understanding May 26, 2025 Action Recognition Action Understanding
— Unverified 0PHI: Bridging Domain Shift in Long-Term Action Quality Assessment via Progressive Hierarchical Instruction May 26, 2025 Action Quality Assessment Action Recognition
Code Code Available 0Temporal Consistency Constrained Transferable Adversarial Attacks with Background Mixup for Action Recognition May 23, 2025 Action Recognition Adversarial Attack
Code Code Available 0Leveraging Foundation Models for Multimodal Graph-Based Action Recognition May 21, 2025 Action Recognition Graph Attention
— Unverified 0Egocentric Action-aware Inertial Localization in Point Clouds May 20, 2025 Action Recognition
Code Code Available 0Are Spatial-Temporal Graph Convolution Networks for Human Action Recognition Over-Parameterized? May 15, 2025 Action Recognition Temporal Action Localization
Code Code Available 0Mission Balance: Generating Under-represented Class Samples using Video Diffusion Models May 14, 2025 Action Recognition
Code Code Available 0Reinforcement Learning meets Masked Video Modeling : Trajectory-Guided Adaptive Token Selection May 13, 2025 Action Recognition Optical Flow Estimation
— Unverified 0A Short Overview of Multi-Modal Wi-Fi Sensing May 10, 2025 Action Recognition Crowd Counting
— Unverified 0Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition May 9, 2025 Action Recognition cross-modal alignment
Code Code Available 0DetReIDX: A Stress-Test Dataset for Real-World UAV-Based Person Recognition May 7, 2025 Action Recognition Federated Lifelong Person ReID
Code Code Available 0Direct Motion Models for Assessing Generated Videos Apr 30, 2025 Action Recognition
— Unverified 0Talk is Not Always Cheap: Promoting Wireless Sensing Models with Text Prompts Apr 20, 2025 Action Localization Action Recognition
Code Code Available 0Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization Apr 19, 2025 Action Recognition Activity Recognition
— Unverified 0Are you SURE? Enhancing Multimodal Pretraining with Missing Modalities through Uncertainty Estimation Apr 18, 2025 Action Recognition Genre classification
— Unverified 0PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition Apr 17, 2025 Action Recognition Action Understanding
— Unverified 0SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature Aggregation Apr 16, 2025 Action Recognition One-Shot 3D Action Recognition
Code Code Available 1H-MoRe: Learning Human-centric Motion Representation for Action Analysis Apr 14, 2025 Action Analysis Action Recognition
Code Code Available 0Knowledge Distillation for Multimodal Egocentric Action Recognition Robust to Missing Modalities Apr 11, 2025 Action Recognition Knowledge Distillation
— Unverified 0Towards Micro-Action Recognition with Limited Annotations: An Asynchronous Pseudo Labeling and Training Approach Apr 10, 2025 Action Recognition Micro-Action Recognition
— Unverified 0Temporal Alignment-Free Video Matching for Few-shot Action Recognition Apr 8, 2025 Action Recognition Few-Shot action recognition
Code Code Available 1MultiSensor-Home: A Wide-area Multi-modal Multi-view Dataset for Action Recognition and Transformer-based Sensor Fusion Apr 3, 2025 Action Recognition Human Detection
Code Code Available 0MultiTSF: Transformer-based Sensor Fusion for Human-Centric Multi-view and Multi-modal Action Recognition Apr 3, 2025 Action Recognition Human Detection
— Unverified 0LSC-ADL: An Activity of Daily Living (ADL)-Annotated Lifelog Dataset Generated via Semi-Automatic Clustering Apr 2, 2025 Action Recognition Activity Recognition
— Unverified 0Is Temporal Prompting All We Need For Limited Labeled Action Recognition? Apr 2, 2025 Action Recognition All
— Unverified 0CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition Mar 30, 2025 Action Classification Action Recognition
— Unverified 0OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition Mar 30, 2025 Action Classification Action Recognition
— Unverified 0Action Recognition in Real-World Ambient Assisted Living Environment Mar 29, 2025 Action Recognition Computational Efficiency
Code Code Available 0ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection Mar 28, 2025 Action Recognition Human-Object Interaction Detection
— Unverified 0Siformer: Feature-isolated Transformer for Efficient Skeleton-based Sign Language Recognition Mar 26, 2025 Action Recognition Computational Efficiency
Code Code Available 1fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models Mar 25, 2025 Action Recognition Surgical phase recognition
— Unverified 0Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings Mar 25, 2025 4k Action Recognition
Code Code Available 2LLaVAction: evaluating and training multi-modal large language models for action recognition Mar 24, 2025 Action Recognition Action Understanding
Code Code Available 2