SOTAVerified

Procedure Step Recognition

Procedure Step Recognition (PSR) focuses on recognizing the correct completion and order of procedural steps. Unlike traditional action recognition, which lacks a measure of success for actions, PSR aims to provide a meaningful understanding for procedural videos: recognizing the outcome (complete? correct?) of a procedural step is often more relevant than recognizing (partial) execution of an action.

Therefore, the objective of PSR is to extract an estimate of all procedure steps correctly performed by a person up to time $t$, based on sensory inputs $X_t=(x_t, x_{t-1}, \dots, x_{t-h})$ and a descriptive set of the procedural actions to be performed $\mathcal{P}={a_0, a_1, \dots, a_n}$. Here, $h$ is the observation horizon and $n+1$ the total number of actions~$a_i\in \mathcal{P}$ covered in the procedure.

Papers

Showing 11 of 1 papers

TitleStatusHype
IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like SettingCode1
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1B3 - Synthetic OnlyDelay (seconds)49.5Unverified
2B3Delay (seconds)22.4Unverified