| LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model | Mar 18, 2024 | Adversarial AttackStyle Transfer | —Unverified | 0 |
| Scheduled Differentiable Architecture Search for Visual Recognition | Sep 23, 2019 | Video Recognition | —Unverified | 0 |
| 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition | Dec 29, 2020 | Action RecognitionPolicy Gradient Methods | —Unverified | 0 |
| Maximum A Posteriori Estimation of Distances Between Deep Features in Still-to-Video Face Recognition | Aug 26, 2017 | Face RecognitionVideo Recognition | —Unverified | 0 |
| MeMSVD: Long-Range Temporal Structure Capturing Using Incremental SVD | Jun 11, 2024 | Video RecognitionVideo Understanding | —Unverified | 0 |
| The Influence of Audio on Video Memorability with an Audio Gestalt Regulated Video Memorability System | Apr 23, 2021 | Multimodal Deep LearningVideo Recognition | —Unverified | 0 |
| Mimic The Raw Domain: Accelerating Action Recognition in the Compressed Domain | Nov 19, 2019 | Action RecognitionVideo Recognition | —Unverified | 0 |
| M&M Mix: A Multimodal Multiview Transformer Ensemble | Jun 20, 2022 | Action RecognitionVideo Recognition | —Unverified | 0 |
| Modeling Sub-Event Dynamics in First-Person Action Recognition | Jul 1, 2017 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| Morph: Flexible Acceleration for 3D CNN-based Video Understanding | Oct 16, 2018 | MORPHVideo Recognition | —Unverified | 0 |
| Motion-Augmented Self-Training for Video Recognition at Smaller Scale | May 4, 2021 | Action RecognitionOptical Flow Estimation | —Unverified | 0 |
| Motion Guided Token Compression for Efficient Masked Video Modeling | Jan 10, 2024 | Video CompressionVideo Recognition | —Unverified | 0 |
| Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling | Aug 25, 2022 | Video Recognition | —Unverified | 0 |
| MRET: Multi-resolution Transformer for Video Quality Assessment | Mar 13, 2023 | Video Quality AssessmentVideo Recognition | —Unverified | 0 |
| MultAV: Multiplicative Adversarial Videos | Sep 17, 2020 | Adversarial AttackVideo Recognition | —Unverified | 0 |
| Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition | Jul 31, 2019 | Action RecognitionGeneral Classification | —Unverified | 0 |
| Multi-Fiber Networks for Video Recognition | Jul 30, 2018 | Action ClassificationAction Recognition | —Unverified | 0 |
| Multi Modal Convolutional Neural Networks for Brain Tumor Segmentation | Sep 17, 2018 | Brain Tumor SegmentationSegmentation | —Unverified | 0 |
| Multimodal Transfer Deep Learning with Applications in Audio-Visual Recognition | Dec 9, 2014 | Deep LearningVideo Recognition | —Unverified | 0 |
| Multi-object Video Generation from Single Frame Layouts | May 6, 2023 | Image GenerationObject | —Unverified | 0 |
| Multiview Pseudo-Labeling for Semi-supervised Learning from Video | Apr 1, 2021 | Representation LearningVideo Recognition | —Unverified | 0 |
| Noise-Tolerant Learning for Audio-Visual Action Recognition | May 16, 2022 | Action RecognitionNoise Estimation | —Unverified | 0 |
| Non-local NetVLAD Encoding for Video Classification | Sep 29, 2018 | ClassificationGeneral Classification | —Unverified | 0 |
| Towards Efficient Tensor Decomposition-Based DNN Model Compression with Optimization Framework | Jul 26, 2021 | image-classificationImage Classification | —Unverified | 0 |
| NSNet: Non-saliency Suppression Sampler for Efficient Video Recognition | Jul 21, 2022 | Action RecognitionVideo Classification | —Unverified | 0 |
| Towards Extremely Compact RNNs for Video Recognition with Fully Decomposed Hierarchical Tucker Structure | Apr 12, 2021 | Tensor DecompositionVideo Recognition | —Unverified | 0 |
| Towards Learning a Vocabulary of Visual Concepts and Operators using Deep Neural Networks | Sep 1, 2021 | Video Recognition | —Unverified | 0 |
| Adaptive Detrending to Accelerate Convolutional Gated Recurrent Unit Training for Contextual Video Recognition | May 24, 2017 | Video Recognition | —Unverified | 0 |
| On the Importance of Spatial Relations for Few-shot Action Recognition | Aug 14, 2023 | Action RecognitionFew-Shot action recognition | —Unverified | 0 |
| On the Pitfalls of Learning with Limited Data: A Facial Expression Recognition Case Study | Apr 2, 2021 | Data AugmentationDeep Learning | —Unverified | 0 |
| On the Surprising Effectiveness of Transformers in Low-Labeled Video Recognition | Sep 15, 2022 | image-classificationImage Classification | —Unverified | 0 |
| PA3D: Pose-Action 3D Machine for Video Recognition | Jun 1, 2019 | Action RecognitionOptical Flow Estimation | —Unverified | 0 |
| Towards Scalable Modeling of Compressed Videos for Efficient Action Recognition | Mar 17, 2025 | Action RecognitionVideo Recognition | —Unverified | 0 |
| Transfer Learning for Video Recognition with Scarce Training Data for Deep Convolutional Neural Network | Sep 15, 2014 | 4kTransfer Learning | —Unverified | 0 |
| Attention Distillation for Learning Video Representations | Apr 5, 2019 | Action RecognitionVideo Recognition | —Unverified | 0 |
| Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition | Feb 29, 2024 | Transfer LearningVideo Recognition | —Unverified | 0 |
| Phase-Specific Augmented Reality Guidance for Microscopic Cataract Surgery Using Long-Short Spatiotemporal Aggregation Transformer | Sep 11, 2023 | Multi-Task LearningVideo Recognition | —Unverified | 0 |
| Transfer-LMR: Heavy-Tail Driving Behavior Recognition in Diverse Traffic Scenarios | May 8, 2024 | Video Recognition | —Unverified | 0 |
| Action Keypoint Network for Efficient Video Recognition | Jan 17, 2022 | Action RecognitionPoint Cloud Classification | —Unverified | 0 |
| A^2-Nets: Double Attention Networks | Oct 27, 2018 | 3D Absolute Human Pose EstimationAction Classification | —Unverified | 0 |
| Purification Of Contaminated Convolutional Neural Networks Via Robust Recovery: An Approach with Theoretical Guarantee in One-Hidden-Layer Case | Jul 4, 2024 | image-classificationImage Classification | —Unverified | 0 |
| PV-NAS: Practical Neural Architecture Search for Video Recognition | Nov 2, 2020 | Neural Architecture SearchVideo Recognition | —Unverified | 0 |
| Action Detail Matters: Refining Video Recognition with Local Action Queries | Jan 1, 2025 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| A Simple and Efficient Baseline for Video Action Recognition | Mar 2, 2025 | Action RecognitionFine-grained Action Recognition | —Unverified | 0 |
| Attention Transfer from Web Images for Video Recognition | Aug 3, 2017 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| A two-way translation system of Chinese sign language based on computer vision | Jun 3, 2023 | SentenceSign Language Recognition | —Unverified | 0 |
| HaltingVT: Adaptive Token Halting Transformer for Efficient Video Recognition | Jan 10, 2024 | Action RecognitionAction Recognition In Videos | CodeCode Available | 0 |
| A^2-Nets: Double Attention Networks | Dec 1, 2018 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Heuristic Black-box Adversarial Attacks on Video Recognition Models | Nov 21, 2019 | Adversarial AttackVideo Recognition | CodeCode Available | 0 |
| Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | Jun 1, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |