SOTAVerified

Video Recognition

Video Recognition is a process of obtaining, processing, and analysing data that it receives from a visual source, specifically video.

Papers

Showing 151200 of 307 papers

TitleStatusHype
GTM: Gray Temporal Model for Video Recognition0
Boosting the Transferability of Video Adversarial Examples via Temporal TranslationCode1
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge DeviceCode2
QTTNet: Quantized Tensor Train Neural Networks for 3D Object and Video Recognition.Code0
Unsupervised 3D Pose Estimation for Hierarchical Dance Video RecognitionCode1
Large-vocabulary Audio-visual Speech Recognition in Noisy Environments0
Revisiting 3D ResNets for Video RecognitionCode0
Towards Learning a Vocabulary of Visual Concepts and Operators using Deep Neural Networks0
Searching for Two-Stream Models in Multivariate Space for Video Recognition0
Reinforcement Learning Based Sparse Black-box Adversarial Attack on Video Recognition Models0
Dynamic Network Quantization for Efficient Video InferenceCode1
Towards Efficient Tensor Decomposition-Based DNN Model Compression with Optimization Framework0
Inter-intra Variant Dual Representations forSelf-supervised Video RecognitionCode0
Can An Image Classifier Suffice For Action Recognition?Code1
Video Swin TransformerCode2
TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?Code1
Towards Long-Form Video UnderstandingCode1
Self-supervised Video Representation Learning with Cross-Stream Prototypical ContrastingCode1
PyKale: Knowledge-Aware Machine Learning from Multiple Sources in PythonCode1
VidHarm: A Clip Based Dataset for Harmful Content Detection0
Space-time Mixing Attention for Video TransformerCode1
Continual 3D Convolutional Neural Networks for Real-time Processing of VideosCode1
DSANet: Dynamic Segment Aggregation Network for Video-Level Representation LearningCode1
Sharing Pain: Using Pain Domain Transfer for Video Recognition of Low Grade Orthopedic Pain in HorsesCode1
AdaMML: Adaptive Multi-Modal Learning for Efficient Video RecognitionCode1
Adaptive Focus for Efficient Video RecognitionCode1
VideoLT: Large-scale Long-tailed Video RecognitionCode1
Motion-Augmented Self-Training for Video Recognition at Smaller Scale0
FrameExit: Conditional Early Exiting for Efficient Video RecognitionCode1
The Influence of Audio on Video Memorability with an Audio Gestalt Regulated Video Memorability System0
Multiscale Vision TransformersCode1
HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition0
Towards Extremely Compact RNNs for Video Recognition with Fully Decomposed Hierarchical Tucker Structure0
On the Pitfalls of Learning with Limited Data: A Facial Expression Recognition Case Study0
Visual Semantic Role Labeling for Video UnderstandingCode1
Multiview Pseudo-Labeling for Semi-supervised Learning from Video0
Recognizing Actions in Videos from Unseen Viewpoints0
Learning Versatile Neural Architectures by Propagating Network CodesCode1
MoViNets: Mobile Video Networks for Efficient Video RecognitionCode1
PatchNet -- Short-range Template Matching for Efficient Video ProcessingCode1
Video Transformer NetworkCode0
Piano Skills AssessmentCode1
Multi-Modal Multi-Action Video RecognitionCode0
Interactive Prototype Learning for Egocentric Action Recognition0
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition0
DeepGamble: Towards unlocking real-time player intelligence using multi-layer instance segmentation and attribute detection0
MVFNet: Multi-View Fusion Network for Efficient Video RecognitionCode1
Overcomplete Representations Against Adversarial VideosCode0
Learning Equivariant RepresentationsCode1
Open-Ended Multi-Modal Relational Reasoning for Video Question AnsweringCode0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.