SOTAVerified

Action Parsing

Action parsing is the task of, given a video or still image, assigning each frame or image a label describing the action in that frame or image.

Papers

Showing 110 of 15 papers

TitleStatusHype
Modeling Worlds in TextCode1
Local Temporal Bilinear Pooling for Fine-grained Action ParsingCode1
Frontal Low-rank Random Tensors for Fine-grained Action SegmentationCode0
An Expressive Deep Model for Human Action Parsing from A Single Image0
DAP3D-Net: Where, What and How Actions Occur in Videos?0
IncSQL: Training Incremental Text-to-SQL Parsers with Non-Deterministic Oracles0
Intra- and Inter-Action Understanding via Temporal Action Parsing0
Learning Knowledge Graph-based World Models of Textual Environments0
Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework0
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Seq2SeqSet accuracy18.1Unverified
2CALMSet accuracy13.79Unverified