SOTAVerified

Action Understanding

Papers

Showing 2650 of 88 papers

TitleStatusHype
Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning0
Enhancing Video Transformers for Action Understanding with VLM-aided Training0
Impact of Large Language Model Assistance on Patients Reading Clinical Notes: A Mixed-Methods Study0
Multitask Learning in Minimally Invasive Surgical Vision: A Review0
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional SportsCode1
FineSports: A Multi-person Hierarchical Sports Video Dataset for Fine-grained Action UnderstandingCode1
Open-Vocabulary Video Relation ExtractionCode1
Towards Weakly Supervised End-to-end Learning for Long-video Action Recognition0
Kantian Deontology Meets AI Alignment: Towards Morally Grounded Fairness Metrics0
Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action UnderstandingCode1
Memory-and-Anticipation Transformer for Online Action UnderstandingCode1
Prompted Contrast with Masked Motion Modeling: Towards Versatile 3D Action Representation LearningCode1
Paxion: Patching Action Knowledge in Video-Language Foundation ModelsCode1
Comparing Machines and Children: Using Developmental Psychology Experiments to Assess the Strengths and Weaknesses of LaMDA Responses0
ATTACH Dataset: Annotated Two-Handed Assembly Actions for Human Action Understanding0
From Isolated Islands to Pangea: Unifying Semantic Space for Human Action Understanding0
mRI: Multi-modal 3D Human Pose Estimation Dataset using mmWave, RGB-D, and Inertial Sensors0
ACT-Thor: A Controlled Benchmark for Embodied Action Understanding in Simulated EnvironmentsCode0
Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic ActionsCode1
Action Quality Assessment with Temporal Parsing TransformerCode1
Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled VideosCode0
Invisible-to-Visible: Privacy-Aware Human Instance Segmentation using Airborne Ultrasound via Collaborative Learning Variational Autoencoder0
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional VideosCode1
Domain Knowledge-Informed Self-Supervised Representations for Workout Form AssessmentCode1
Towards Tokenized Human Dynamics RepresentationCode1
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.