SOTAVerified

Action Parsing

Action parsing is the task of, given a video or still image, assigning each frame or image a label describing the action in that frame or image.

Papers

Showing 110 of 15 papers

TitleStatusHype
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation0
Action parsing using context features0
Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework0
Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action Parsing0
A Baseline Framework for Part-level Action Parsing and Action Recognition0
Learning Knowledge Graph-based World Models of Textual Environments0
SSCAP: Self-supervised Co-occurrence Action Parsing for Unsupervised Temporal Action Segmentation0
Modeling Worlds in TextCode1
Intra- and Inter-Action Understanding via Temporal Action Parsing0
Frontal Low-rank Random Tensors for Fine-grained Action SegmentationCode0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Seq2SeqSet accuracy18.1Unverified
2CALMSet accuracy13.79Unverified