Action Recognition In Videos
Action Recognition in Videos is a task in computer vision and pattern recognition where the goal is to identify and categorize human actions performed in a video sequence. The task involves analyzing the spatiotemporal dynamics of the actions and mapping them to a predefined set of action classes, such as running, jumping, or swimming.
Papers
Showing 1–10 of 124 papers
All datasetsJester (Gesture Recognition)PKU-MMDUCF101Something-Something V2Kinetics 400AVA v2.2FS-Something-Something V2-FullFS-Something-Something V2-SmallSports-1MTHUMOS14ActivityNetAVA v2.1
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Florence | Top-1 Accuracy | 86.5 | — | Unverified |
| 2 | ActionCLIP (ViT-B/16) | Top-1 Accuracy | 83.8 | — | Unverified |
| 3 | Frozen Backbone, SwinV2-G-ext22K (Video-Swin) | Top-1 Accuracy | 81.7 | — | Unverified |