SOTAVerified

Action Understanding

Papers

Showing 2650 of 88 papers

TitleStatusHype
Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action UnderstandingCode1
Human Action Segmentation With Hierarchical Supervoxel Consistency0
HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding0
Impact of Large Language Model Assistance on Patients Reading Clinical Notes: A Mixed-Methods Study0
Intra- and Inter-Action Understanding via Temporal Action Parsing0
Invisible-to-Visible: Privacy-Aware Human Instance Segmentation using Airborne Ultrasound via Collaborative Learning Variational Autoencoder0
JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection0
Kantian Deontology Meets AI Alignment: Towards Morally Grounded Fairness Metrics0
MacDiff: Unified Skeleton Modeling with Masked Conditional Diffusion0
MMAct: A Large-Scale Dataset for Cross Modal Human Action Understanding0
mRI: Multi-modal 3D Human Pose Estimation Dataset using mmWave, RGB-D, and Inertial Sensors0
Multitask Learning in Minimally Invasive Surgical Vision: A Review0
PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition0
PKU-MMD: A Large Scale Benchmark for Continuous Multi-Modal Human Action Understanding0
Probing Fine-Grained Action Understanding and Cross-View Generalization of Foundation Models0
Region-aware Image-based Human Action Retrieval with Transformers0
RoboAct-CLIP: Video-Driven Pre-training of Atomic Action Understanding for Robotics0
Scene Understanding for Autonomous Manipulation with Deep Learning0
ScreenLLM: Stateful Screen Schema for Efficient Action Understanding and Prediction0
Self-supervised Discovery of Human Actons from Long Kinematic Videos0
Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning0
STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding0
The SkatingVerse Workshop & Challenge: Methods and Results0
Action Understanding with Multiple Classes of Actors0
Actor and Action Modular Network for Text-based Video Segmentation0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.