SOTAVerified

Action Understanding

Papers

Showing 150 of 88 papers

TitleStatusHype
LLaVAction: evaluating and training multi-modal large language models for action recognitionCode2
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow UnderstandingCode2
Paxion: Patching Action Knowledge in Video-Language Foundation ModelsCode1
Home Action Genome: Cooperative Compositional Action UnderstandingCode1
PIANO: A Parametric Hand Bone Model from Magnetic Resonance ImagingCode1
F^3Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from VideosCode1
YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific VideosCode1
Open-Vocabulary Video Relation ExtractionCode1
LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task ActivitiesCode1
Temporal Relational Modeling with Self-Supervision for Action SegmentationCode1
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional VideosCode1
Towards Tokenized Human Dynamics RepresentationCode1
Unified Multi-modal Unsupervised Representation Learning for Skeleton-based Action UnderstandingCode1
Memory-and-Anticipation Transformer for Online Action UnderstandingCode1
Video Pose Distillation for Few-Shot, Fine-Grained Sports Action RecognitionCode1
Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic ActionsCode1
Action Quality Assessment with Temporal Parsing TransformerCode1
Prompted Contrast with Masked Motion Modeling: Towards Versatile 3D Action Representation LearningCode1
Detailed 2D-3D Joint Representation for Human-Object InteractionCode1
FineParser: A Fine-grained Spatio-temporal Action Parser for Human-centric Action Quality AssessmentCode1
FineSports: A Multi-person Hierarchical Sports Video Dataset for Fine-grained Action UnderstandingCode1
Domain Knowledge-Informed Self-Supervised Representations for Workout Form AssessmentCode1
SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning StabilizationCode1
EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action UnderstandingCode1
Language-Assisted Skeleton Action Understanding for Skeleton-Based Temporal Action SegmentationCode1
Sports-QA: A Large-Scale Video Question Answering Benchmark for Complex and Professional SportsCode1
Few-Shot Fine-Grained Action Recognition via Bidirectional Attention and Contrastive Meta-LearningCode0
Action Recognition with Trajectory-Pooled Deep-Convolutional DescriptorsCode0
Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled VideosCode0
ACT-Thor: A Controlled Benchmark for Embodied Action Understanding in Simulated EnvironmentsCode0
Win-Fail Action RecognitionCode0
Online Spatiotemporal Action Detection and Prediction via Causal RepresentationsCode0
LLaVA-Pose: Enhancing Human Pose and Action Understanding via Keypoint-Integrated Instruction TuningCode0
Video Action UnderstandingCode0
Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and BeyondCode0
mRI: Multi-modal 3D Human Pose Estimation Dataset using mmWave, RGB-D, and Inertial Sensors0
Multitask Learning in Minimally Invasive Surgical Vision: A Review0
PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition0
PKU-MMD: A Large Scale Benchmark for Continuous Multi-Modal Human Action Understanding0
Probing Fine-Grained Action Understanding and Cross-View Generalization of Foundation Models0
Region-aware Image-based Human Action Retrieval with Transformers0
RoboAct-CLIP: Video-Driven Pre-training of Atomic Action Understanding for Robotics0
Scene Understanding for Autonomous Manipulation with Deep Learning0
ScreenLLM: Stateful Screen Schema for Efficient Action Understanding and Prediction0
Self-supervised Discovery of Human Actons from Long Kinematic Videos0
Social-MAE: Social Masked Autoencoder for Multi-person Motion Representation Learning0
STPro: Spatial and Temporal Progressive Learning for Weakly Supervised Spatio-Temporal Grounding0
Theory of Minds: Understanding Behavior in Groups Through Inverse Planning0
The Role of Video Generation in Enhancing Data-Limited Action Understanding0
About Time: Advances, Challenges, and Outlooks of Action Understanding0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.