Action Recognition In Videos
Action Recognition in Videos is a task in computer vision and pattern recognition where the goal is to identify and categorize human actions performed in a video sequence. The task involves analyzing the spatiotemporal dynamics of the actions and mapping them to a predefined set of action classes, such as running, jumping, or swimming.
Papers
Showing 1–10 of 124 papers
All datasetsJester (Gesture Recognition)PKU-MMDUCF101Something-Something V2Kinetics 400AVA v2.2FS-Something-Something V2-FullFS-Something-Something V2-SmallSports-1MTHUMOS14ActivityNetAVA v2.1
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | STM (16 frames, ImageNet pretraining) | Top-1 Accuracy | 64.2 | — | Unverified |
| 2 | CPNet Res34, 5 CP | Top-1 Accuracy | 57.65 | — | Unverified |
| 3 | 2-Stream TRN | Top-1 Accuracy | 55.52 | — | Unverified |
| 4 | DIN | Top-1 Accuracy | 34.11 | — | Unverified |