SOTAVerified

Video Recognition

Video Recognition is a process of obtaining, processing, and analysing data that it receives from a visual source, specifically video.

Papers

Showing 151200 of 307 papers

TitleStatusHype
Audio-Visual Glance Network for Efficient Video Recognition0
Orthogonal Temporal Interpolation for Zero-Shot Video RecognitionCode0
On the Importance of Spatial Relations for Few-shot Action Recognition0
View while Moving: Efficient Video Recognition in Long-untrimmed Videos0
TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible AdapterCode0
Enhanced Multimodal Representation Learning with Cross-modal KD0
A two-way translation system of Chinese sign language based on computer vision0
Hiera: A Hierarchical Vision Transformer without the Bells-and-WhistlesCode0
Spatiotemporal Attention-based Semantic Compression for Real-time Video Recognition0
Inter-frame Accelerate Attack against Video Interpolation Models0
Multi-object Video Generation from Single Frame Layouts0
Use Your Head: Improving Long-Tail Video RecognitionCode0
Efficient Decision-based Black-box Patch Attacks on Video Recognition0
Video Action Recognition with Attentive Semantic Units0
MRET: Multi-resolution Transformer for Video Quality Assessment0
Maximizing Spatio-Temporal Entropy of Deep 3D CNNs for Efficient Video RecognitionCode0
Video4MRI: An Empirical Study on Brain Magnetic Resonance Image Analytics with CNN-based Video Classification Frameworks0
Efficient Robustness Assessment via Adversarial Spatial-Temporal Focus on VideosCode0
Tiny Updater: Towards Efficient Neural Network-Driven Software UpdatingCode0
Algorithm and Hardware Co-Design of Energy-Efficient LSTM Networks for Video Recognition with Hierarchical Tucker Tensor Decomposition0
Temporal superimposed crossover module for effective continuous sign languageCode0
REST: REtrieve & Self-Train for generative action recognition0
On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition0
Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling0
Efficient Attention-free Video Shift Transformers0
Adaptive occlusion sensitivity analysis for visually explaining video recognition networksCode0
Object State Change Classification in Egocentric Videos using the Divided Space-Time Attention MechanismCode0
NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition0
Temporal Saliency Query Network for Efficient Video Recognition0
Is an Object-Centric Video Representation Beneficial for Transfer?0
VidConv: A modernized 2D ConvNet for Efficient Video RecognitionCode0
EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2022: Team HNU-FPV Technical Report0
Exploring Temporally Dynamic Data Augmentation for Video Recognition0
M&M Mix: A Multimodal Multiview Transformer Ensemble0
MLP-3D: A MLP-like 3D Architecture with Grouped Time MixingCode0
Spatial-temporal Concept based Explanation of 3D ConvNetsCode0
Noise-Tolerant Learning for Audio-Visual Action Recognition0
Class-Incremental Learning for Action Recognition in Videos0
FAR: Fourier Aerial Video RecognitionCode0
Gate-Shift-Fuse for Video Action RecognitionCode0
Audio-Visual Fusion Layers for Event Type Aware Video Recognition0
Should I take a walk? Estimating Energy Expenditure from Video DataCode0
Action Keypoint Network for Efficient Video Recognition0
Condensing a Sequence to One Informative Frame for Video Recognition0
Optimization Planning for 3D ConvNetsCode0
Improving Video Model Transfer With Dynamic Representation Learning0
Recurring the Transformer for Video Action Recognition0
Cross-Modal Transferable Adversarial Attacks from Images to Videos0
Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search0
ST-ABN: Visual Explanation Taking into Account Spatio-temporal Information for Video RecognitionCode0
Show:102550
← PrevPage 4 of 7Next →

No leaderboard results yet.