SOTAVerified

Action Understanding

Papers

Showing 110 of 88 papers

TitleStatusHype
LLaVA-Pose: Enhancing Human Pose and Action Understanding via Keypoint-Integrated Instruction TuningCode0
The Role of Video Generation in Enhancing Data-Limited Action Understanding0
PCBEAR: Pose Concept Bottleneck for Explainable Action Recognition0
F^3Set: Towards Analyzing Fast, Frequent, and Fine-grained Events from VideosCode1
RoboAct-CLIP: Video-Driven Pre-training of Atomic Action Understanding for Robotics0
Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery0
ScreenLLM: Stateful Screen Schema for Efficient Action Understanding and Prediction0
LLaVAction: evaluating and training multi-modal large language models for action recognitionCode2
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models0
HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding0
Show:102550
← PrevPage 1 of 9Next →

No leaderboard results yet.