SOTAVerified

Video Recognition

Video Recognition is a process of obtaining, processing, and analysing data that it receives from a visual source, specifically video.

Papers

Showing 101150 of 307 papers

TitleStatusHype
FrameExit: Conditional Early Exiting for Efficient Video RecognitionCode1
Frame Flexible NetworkCode1
Frozen CLIP Models are Efficient Video LearnersCode1
Adapting Short-Term Transformers for Action Detection in Untrimmed VideosCode1
Space-time Mixing Attention for Video TransformerCode1
Generalized Few-Shot Video Classification with Video Retrieval and Feature GenerationCode1
Audio-Visual Class-Incremental LearningCode1
BASKET: A Large-Scale Video Dataset for Fine-Grained Skill EstimationCode1
SVFormer: Semi-supervised Video Transformer for Action RecognitionCode1
Glance and Focus Networks for Dynamic Visual RecognitionCode1
Group Contextualization for Video RecognitionCode1
TSM: Temporal Shift Module for Efficient Video UnderstandingCode1
Automated Sperm Assessment Framework and Neural Network Specialized for Sperm Video RecognitionCode0
Training Kinetics in 15 Minutes: Large-scale Distributed Training on VideosCode0
Inter-intra Variant Dual Representations forSelf-supervised Video RecognitionCode0
Audiovisual SlowFast Networks for Video RecognitionCode0
Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video RecognitionCode0
testRNN: Coverage-guided Testing on Recurrent Neural NetworksCode0
Hiera: A Hierarchical Vision Transformer without the Bells-and-WhistlesCode0
Heuristic Black-box Adversarial Attacks on Video Recognition ModelsCode0
Coverage Guided Testing for Recurrent Neural NetworksCode0
Tiny Updater: Towards Efficient Neural Network-Driven Software UpdatingCode0
HaltingVT: Adaptive Token Halting Transformer for Efficient Video RecognitionCode0
Temporal Modeling Approaches for Large-scale Youtube-8M Video UnderstandingCode0
Temporal superimposed crossover module for effective continuous sign languageCode0
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video RecognitionCode0
GenRec: Unifying Video Generation and Recognition with Diffusion ModelsCode0
Spatial-temporal Concept based Explanation of 3D ConvNetsCode0
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible AdapterCode0
Gate-Shift-Fuse for Video Action RecognitionCode0
Sparse Black-box Video Attack with Reinforcement LearningCode0
FAR: Fourier Aerial Video RecognitionCode0
Flow-Guided Feature Aggregation for Video Object DetectionCode0
Collaborative Spatiotemporal Feature Learning for Video Action RecognitionCode0
Sequence Level Semantics Aggregation for Video Object DetectionCode0
Should I take a walk? Estimating Energy Expenditure from Video DataCode0
Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a VideoCode0
Collaborative Spatio-temporal Feature Learning for Video Action RecognitionCode0
QTTNet: Quantized Tensor Train Neural Networks for 3D Object and Video Recognition.Code0
Excitation Dropout: Encouraging Plasticity in Deep Neural NetworksCode0
Revisiting 3D ResNets for Video RecognitionCode0
PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video RecognitionCode0
Efficient Robustness Assessment via Adversarial Spatial-Temporal Focus on VideosCode0
Overcomplete Representations Against Adversarial VideosCode0
A^2-Nets: Double Attention NetworksCode0
On the Relevance of Temporal Features for Medical Ultrasound Video RecognitionCode0
Open-Ended Multi-Modal Relational Reasoning for Video Question AnsweringCode0
Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention MechanismCode0
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action RecognitionCode0
Learning to Localize Temporal Events in Large-scale Video DataCode0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.