SOTAVerified

Video Recognition

Video Recognition is a process of obtaining, processing, and analysing data that it receives from a visual source, specifically video.

Papers

Showing 5175 of 307 papers

TitleStatusHype
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to VideoCode1
Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer LearningCode1
Phase-Specific Augmented Reality Guidance for Microscopic Cataract Surgery Using Long-Short Spatiotemporal Aggregation Transformer0
Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving0
Eventful Transformers: Leveraging Temporal Redundancy in Vision TransformersCode1
Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video RecognitionCode0
Audio-Visual Class-Incremental LearningCode1
Temporal-Distributed Backdoor Attack Against Video Based Action Recognition0
Audio-Visual Glance Network for Efficient Video Recognition0
Helping Hands: An Object-Aware Ego-Centric Video Recognition ModelCode1
On the Importance of Spatial Relations for Few-shot Action Recognition0
Orthogonal Temporal Interpolation for Zero-Shot Video RecognitionCode0
View while Moving: Efficient Video Recognition in Long-untrimmed Videos0
Prune Spatio-temporal Tokens by Semantic-aware Temporal AccumulationCode1
What Can Simple Arithmetic Operations Do for Temporal Modeling?Code1
Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action RecognitionCode1
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible AdapterCode0
Enhanced Multimodal Representation Learning with Cross-modal KD0
A two-way translation system of Chinese sign language based on computer vision0
Hiera: A Hierarchical Vision Transformer without the Bells-and-WhistlesCode0
Spatiotemporal Attention-based Semantic Compression for Real-time Video Recognition0
Inter-frame Accelerate Attack against Video Interpolation Models0
Multi-object Video Generation from Single Frame Layouts0
Implicit Temporal Modeling with Learnable Alignment for Video RecognitionCode1
Use Your Head: Improving Long-Tail Video RecognitionCode0
Show:102550
← PrevPage 3 of 13Next →

No leaderboard results yet.