SOTAVerified

Video Recognition

Video Recognition is a process of obtaining, processing, and analysing data that it receives from a visual source, specifically video.

Papers

Showing 101150 of 307 papers

TitleStatusHype
FrameExit: Conditional Early Exiting for Efficient Video RecognitionCode1
Frame Flexible NetworkCode1
Frozen CLIP Models are Efficient Video LearnersCode1
Adapting Short-Term Transformers for Action Detection in Untrimmed VideosCode1
Temporal-attentive Covariance Pooling Networks for Video RecognitionCode1
Generalized Few-Shot Video Classification with Video Retrieval and Feature GenerationCode1
VG4D: Vision-Language Model Goes 4D Video RecognitionCode1
TAM: Temporal Adaptive Module for Video RecognitionCode1
DSANet: Dynamic Segment Aggregation Network for Video-Level Representation LearningCode1
Glance and Focus Networks for Dynamic Visual RecognitionCode1
AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video RecognitionCode1
TSM: Temporal Shift Module for Efficient Video UnderstandingCode1
Automated Sperm Assessment Framework and Neural Network Specialized for Sperm Video RecognitionCode0
Unleashing the Power of CNN and Transformer for Balanced RGB-Event Video RecognitionCode0
Inter-intra Variant Dual Representations forSelf-supervised Video RecognitionCode0
Audiovisual SlowFast Networks for Video RecognitionCode0
Hierarchical Augmentation and Distillation for Class Incremental Audio-Visual Video RecognitionCode0
Hiera: A Hierarchical Vision Transformer without the Bells-and-WhistlesCode0
Heuristic Black-box Adversarial Attacks on Video Recognition ModelsCode0
Tiny Updater: Towards Efficient Neural Network-Driven Software UpdatingCode0
HaltingVT: Adaptive Token Halting Transformer for Efficient Video RecognitionCode0
testRNN: Coverage-guided Testing on Recurrent Neural NetworksCode0
Use Your Head: Improving Long-Tail Video RecognitionCode0
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible AdapterCode0
GenRec: Unifying Video Generation and Recognition with Diffusion ModelsCode0
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video RecognitionCode0
Gate-Shift-Fuse for Video Action RecognitionCode0
Sparse Black-box Video Attack with Reinforcement LearningCode0
Spatial-temporal Concept based Explanation of 3D ConvNetsCode0
FAR: Fourier Aerial Video RecognitionCode0
Flow-Guided Feature Aggregation for Video Object DetectionCode0
Should I take a walk? Estimating Energy Expenditure from Video DataCode0
Collaborative Spatiotemporal Feature Learning for Video Action RecognitionCode0
Sequence Level Semantics Aggregation for Video Object DetectionCode0
Temporal Modeling Approaches for Large-scale Youtube-8M Video UnderstandingCode0
Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a VideoCode0
Collaborative Spatio-temporal Feature Learning for Video Action RecognitionCode0
QTTNet: Quantized Tensor Train Neural Networks for 3D Object and Video Recognition.Code0
Revisiting 3D ResNets for Video RecognitionCode0
Excitation Dropout: Encouraging Plasticity in Deep Neural NetworksCode0
PosMLP-Video: Spatial and Temporal Relative Position Encoding for Efficient Video RecognitionCode0
Efficient Robustness Assessment via Adversarial Spatial-Temporal Focus on VideosCode0
A^2-Nets: Double Attention NetworksCode0
Overcomplete Representations Against Adversarial VideosCode0
On the Relevance of Temporal Features for Medical Ultrasound Video RecognitionCode0
Open-Ended Multi-Modal Relational Reasoning for Video Question AnsweringCode0
Optimization Planning for 3D ConvNetsCode0
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action RecognitionCode0
Learning to Localize Temporal Events in Large-scale Video DataCode0
Learning Spatio-Temporal Representation with Local and Global DiffusionCode0
Show:102550
← PrevPage 3 of 7Next →

No leaderboard results yet.