SOTAVerified

Video Recognition

Video Recognition is a process of obtaining, processing, and analysing data that it receives from a visual source, specifically video.

Papers

Showing 101150 of 307 papers

TitleStatusHype
FrameExit: Conditional Early Exiting for Efficient Video RecognitionCode1
Frame Flexible NetworkCode1
Frozen CLIP Models are Efficient Video LearnersCode1
Adapting Short-Term Transformers for Action Detection in Untrimmed VideosCode1
AdaFocusV3: On Unified Spatial-temporal Dynamic Video RecognitionCode1
Generalized Few-Shot Video Classification with Video Retrieval and Feature GenerationCode1
Audio-Visual Class-Incremental LearningCode1
0-MMS: Zero-Shot Multi-Motion Segmentation With A Monocular Event CameraCode1
Temporal-attentive Covariance Pooling Networks for Video RecognitionCode1
Glance and Focus Networks for Dynamic Visual RecognitionCode1
Group Contextualization for Video RecognitionCode1
VLG: General Video Recognition with Web Textual KnowledgeCode1
Demonstration of Vector Flow Imaging using Convolutional Neural Networks0
Image and Video Mining through Online Learning0
Action Keypoint Network for Efficient Video Recognition0
Deep Networks With Large Output Spaces0
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition0
Higher-order Network for Action Recognition0
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition0
Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition0
Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data Is Continuous and Weakly Labelled0
Defending Against Multiple and Unforeseen Adversarial Videos0
Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions0
Audio-Visual Glance Network for Efficient Video Recognition0
DeepGamble: Towards unlocking real-time player intelligence using multi-layer instance segmentation and attribute detection0
Motion Guided Token Compression for Efficient Masked Video Modeling0
MRET: Multi-resolution Transformer for Video Quality Assessment0
Audio-Visual Fusion Layers for Event Type Aware Video Recognition0
GTM: Gray Temporal Model for Video Recognition0
DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments0
Adaptive Detrending to Accelerate Convolutional Gated Recurrent Unit Training for Contextual Video Recognition0
MultAV: Multiplicative Adversarial Videos0
Multi-Fiber Networks for Video Recognition0
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning0
Cross-Modal Transferable Adversarial Attacks from Images to Videos0
Generating Videos with Scene Dynamics0
Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition0
Correlation Net: Spatiotemporal multimodal deep learning for action recognition0
Convolutional Neural Network on Three Orthogonal Planes for Dynamic Texture Classification0
Gameplay Highlights Generation0
A two-way translation system of Chinese sign language based on computer vision0
Morph: Flexible Acceleration for 3D CNN-based Video Understanding0
Condensing a Sequence to One Informative Frame for Video Recognition0
Action Detail Matters: Refining Video Recognition with Local Action Queries0
M&M Mix: A Multimodal Multiview Transformer Ensemble0
FlowGraph2Text: Automatic Sentence Skeleton Compilation for Procedural Text Generation0
Flow-Distilled IP Two-Stream Networks for Compressed Video Action Recognition0
Compositional Few-Shot Recognition with Primitive Discovery and Enhancing0
Fast Retinomorphic Event Stream for Video Recognition and Reinforcement Learning0
Attention Transfer from Web Images for Video Recognition0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.