SOTAVerified

Video Recognition

Video Recognition is a process of obtaining, processing, and analysing data that it receives from a visual source, specifically video.

Papers

Showing 3140 of 307 papers

TitleStatusHype
Prune Spatio-temporal Tokens by Semantic-aware Temporal AccumulationCode1
What Can Simple Arithmetic Operations Do for Temporal Modeling?Code1
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action RecognitionCode1
Implicit Temporal Modeling with Learnable Alignment for Video RecognitionCode1
Frame Flexible NetworkCode1
The effectiveness of MAE pre-pretraining for billion-scale pretrainingCode1
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language KnowledgeCode1
Making Vision Transformers Efficient from A Token Sparsification ViewCode1
Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight OptimizationCode1
Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge TransferringCode1
Show:102550
← PrevPage 4 of 31Next →

No leaderboard results yet.