SOTAVerified

Video Recognition

Video Recognition is a process of obtaining, processing, and analysing data that it receives from a visual source, specifically video.

Papers

Showing 2650 of 307 papers

TitleStatusHype
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo BenchmarkCode2
Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions0
No Time to Waste: Squeeze Time into Channel for Mobile Video UnderstandingCode1
Transfer-LMR: Heavy-Tail Driving Behavior Recognition in Diverse Traffic Scenarios0
Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition0
VG4D: Vision-Language Model Goes 4D Video RecognitionCode1
InternVideo2: Scaling Foundation Models for Multimodal Video UnderstandingCode7
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationCode2
LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model0
Don't Judge by the Look: Towards Motion Coherent Video RepresentationCode0
Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition0
Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video RecognitionCode0
Motion Guided Token Compression for Efficient Masked Video Modeling0
HaltingVT: Adaptive Token Halting Transformer for Efficient Video RecognitionCode0
Efficient Selective Audio Masked Multimodal Bottleneck Transformer for Audio-Video Classification0
Video Recognition in Portrait ModeCode1
Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video RecognitionCode0
LogoStyleFool: Vitiating Video Recognition Systems via Logo Style TransferCode0
Adapting Short-Term Transformers for Action Detection in Untrimmed VideosCode1
DEVIAS: Learning Disentangled Video Representations of Action and SceneCode1
OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video RecognitionCode1
Automated Sperm Assessment Framework and Neural Network Specialized for Sperm Video RecognitionCode0
Object-centric Video Representation for Long-term Action AnticipationCode0
On the Relevance of Temporal Features for Medical Ultrasound Video RecognitionCode0
Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and DataCode1
Show:102550
← PrevPage 2 of 13Next →

No leaderboard results yet.