SOTAVerified

Video Recognition

Video Recognition is a process of obtaining, processing, and analysing data that it receives from a visual source, specifically video.

Papers

Showing 125 of 307 papers

TitleStatusHype
InternVideo2: Scaling Foundation Models for Multimodal Video UnderstandingCode7
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal RepresentationsCode5
Expanding Language-Image Pretrained Models for General Video RecognitionCode3
Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language ModelsCode2
Video Swin TransformerCode2
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT AdaptationCode2
AdaptFormer: Adapting Vision Transformers for Scalable Visual RecognitionCode2
Omni-sourced Webly-supervised Learning for Video RecognitionCode2
Uni-AdaFocus: Spatial-temporal Dynamic Computation for Video RecognitionCode2
X3D: Expanding Architectures for Efficient Video RecognitionCode2
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo BenchmarkCode2
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge DeviceCode2
Revisiting Classifier: Transferring Vision-Language Models for Video RecognitionCode2
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?Code2
Adaptive Focus for Efficient Video RecognitionCode1
Cluster and Aggregate: Face Recognition with Large Probe SetCode1
Depth Guided Adaptive Meta-Fusion Network for Few-shot Video RecognitionCode1
Adapting Short-Term Transformers for Action Detection in Untrimmed VideosCode1
Clean-Label Backdoor Attacks on Video Recognition ModelsCode1
Clockwork Convnets for Video Semantic SegmentationCode1
DEVIAS: Learning Disentangled Video Representations of Action and SceneCode1
Boosting the Transferability of Video Adversarial Examples via Temporal TranslationCode1
AdaMML: Adaptive Multi-Modal Learning for Efficient Video RecognitionCode1
Attacking Video Recognition Models with Bullet-Screen CommentsCode1
Building an Open-Vocabulary Video CLIP Model with Better Architectures, Optimization and DataCode1
Show:102550
← PrevPage 1 of 13Next →

No leaderboard results yet.