Temporal Action Localization
Temporal Action Localization aims to detect activities in the video stream and output beginning and end timestamps. It is closely related to Temporal Action Proposal Generation.
Papers
Showing 1–10 of 1477 papers
All datasetsTHUMOS14ActivityNet-1.3HACSFineActionMultiTHUMOSCrossTaskEPIC-KITCHENS-100MUSESActivityNet-1.2Ego4D MQ testEgo4D MQ valMEXaction2
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | RDFA-S6 (InternVideo2-6B) | Average-mAP | 45.8 | — | Unverified |
| 2 | ActionMamba(InternVideo2-6B) | Average-mAP | 44.56 | — | Unverified |
| 3 | DyFADet(VideoMAEv2) | Average-mAP | 44.3 | — | Unverified |
| 4 | InternVideo2-6B | Average-mAP | 43.3 | — | Unverified |
| 5 | TriDet (VideoMAEv2) | Average-mAP | 43.1 | — | Unverified |
| 6 | InternVideo2-1B | Average-mAP | 42.4 | — | Unverified |
| 7 | InternVideo | Average-mAP | 41.55 | — | Unverified |
| 8 | TriDet (SlowFast) | Average-mAP | 38.6 | — | Unverified |
| 9 | TriDet (I3D RGB) | Average-mAP | 36.8 | — | Unverified |
| 10 | TadTr (I3D RGB) | Average-mAP | 32.09 | — | Unverified |