SOTAVerified

Zero-Shot Action Recognition

Papers

Showing 110 of 83 papers

TitleStatusHype
The Role of Video Generation in Enhancing Data-Limited Action Understanding0
Can masking background and object reduce static bias for zero-shot action recognition?0
Semantic-guided Cross-Modal Prompt Learning for Skeleton-based Zero-shot Action Recognition0
Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIPCode1
LoCATe-GAT: Modeling Multi-Scale Local Context and Action Relationships for Zero-Shot Action RecognitionCode0
TDSM: Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action RecognitionCode1
Zero-Shot Action Recognition in Surveillance Videos0
Continual Learning Improves Zero-Shot Action Recognition0
Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment0
Text-Enhanced Zero-Shot Action Recognition: A training-free approach0
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OTI(ViT-L/14)Top-1 Accuracy92.8Unverified
2IMP-MoE-LTop-1 Accuracy91.5Unverified
3MOV (ViT-L/14)Top-1 Accuracy87.1Unverified
4VideoCoCaTop-1 Accuracy86.6Unverified
5BIKETop-1 Accuracy86.6Unverified
6Text4VisTop-1 Accuracy85.8Unverified
7TC-CLIPTop-1 Accuracy85.4Unverified
8EVA-CLIP-E/14+Top-1 Accuracy83.1Unverified
9MOV (ViT-B/16)Top-1 Accuracy82.6Unverified
10OSTTop-1 Accuracy79.7Unverified
#ModelMetricClaimedVerifiedStatus
1MOV (ViT-L/14)Top-1 Accuracy64.7Unverified
2OTI(ViT-L/14)Top-1 Accuracy64Unverified
3BIKETop-1 Accuracy61.4Unverified
4MOV (ViT-B/16)Top-1 Accuracy60.8Unverified
5IMP-MoE-LTop-1 Accuracy59.1Unverified
6VideoCoCaTop-1 Accuracy58.7Unverified
7Text4VisTop-1 Accuracy58.4Unverified
8TC-CLIPTop-1 Accuracy56Unverified
9OSTTop-1 Accuracy55.9Unverified
10MAXITop-1 Accuracy52.3Unverified
#ModelMetricClaimedVerifiedStatus
1TC-CLIPTop-1 Accuracy78.1Unverified
2IMP-MoE-LTop-1 Accuracy76.8Unverified
3OSTTop-1 Accuracy75.1Unverified
4MAXITop-1 Accuracy71.6Unverified
5OTI(ViT-L/14)Top-1 Accuracy70.6Unverified
6VideoCoCaTop-1 Accuracy70.1Unverified
7Text4VisTop-1 Accuracy68.9Unverified
8BIKETop-1 Accuracy68.5Unverified
9X-CLIPTop-1 Accuracy65.2Unverified
10LanguageBindTop-1 Accuracy64.1Unverified
#ModelMetricClaimedVerifiedStatus
1SPOTTop-1 Accuracy68.7Unverified
2CLASTERTop-1 Accuracy68.4Unverified
3ER-ZSARTop-1 Accuracy60.2Unverified
4ZSECOCTop-1 Accuracy59.8Unverified
5TS-GCNTop-1 Accuracy56.5Unverified
6SJE(Atrribute)Top-1 Accuracy47.5Unverified
7MTETop-1 Accuracy44.3Unverified
8ESZSLTop-1 Accuracy39.6Unverified
9SJE(Word Embedding)Top-1 Accuracy28.6Unverified
#ModelMetricClaimedVerifiedStatus
1BIKETop-1 Accuracy86.2Unverified
2Text4VisTop-1 Accuracy84.6Unverified
3LoCATe-GATTop-1 Accuracy73.8Unverified
4ResTTop-1 Accuracy32.5Unverified
5E2ETop-1 Accuracy26.6Unverified
#ModelMetricClaimedVerifiedStatus
1MSQNetmAP35.59Unverified
2VideoCoCamAP25.8Unverified
3MAXImAP23.8Unverified
4CLIP-Hitchhiker (ViT-B/16, 32 frames)mAP21.1Unverified
#ModelMetricClaimedVerifiedStatus
1MSQNetAccuracy75.33Unverified