| CA^2ST: Cross-Attention in Audio, Space, and Time for Holistic Video Recognition | Mar 30, 2025 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| Reinforcement Learning Based Sparse Black-box Adversarial Attack on Video Recognition Models | Aug 29, 2021 | Adversarial Attackreinforcement-learning | —Unverified | 0 | 0 |
| REST: REtrieve & Self-Train for generative action recognition | Sep 29, 2022 | Action RecognitionCaption Generation | —Unverified | 0 | 0 |
| Class-Incremental Learning for Action Recognition in Videos | Mar 25, 2022 | Action RecognitionAction Recognition In Videos | —Unverified | 0 | 0 |
| Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving | Sep 8, 2023 | AllAutonomous Driving | —Unverified | 0 | 0 |
| Retro-Actions: Learning 'Close' by Time-Reversing 'Open' Videos | Sep 20, 2019 | Data AugmentationVideo Recognition | —Unverified | 0 | 0 |
| Multi-Task Learning of Generalizable Representations for Video Action Recognition | Nov 20, 2018 | Action RecognitionMulti-Task Learning | —Unverified | 0 | 0 |
| Compositional Few-Shot Recognition with Primitive Discovery and Enhancing | May 12, 2020 | Few-Shot Image ClassificationFew-Shot Learning | —Unverified | 0 | 0 |
| Condensing a Sequence to One Informative Frame for Video Recognition | Jan 11, 2022 | Motion Estimationvalid | —Unverified | 0 | 0 |
| Convolutional Neural Network on Three Orthogonal Planes for Dynamic Texture Classification | Mar 16, 2017 | General ClassificationRetrieval | —Unverified | 0 | 0 |
| Correlation Net: Spatiotemporal multimodal deep learning for action recognition | Jul 22, 2018 | Action RecognitionDeep Learning | —Unverified | 0 | 0 |
| Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition | Apr 30, 2024 | Action ClassificationAction Recognition | —Unverified | 0 | 0 |
| Cross-Modal Transferable Adversarial Attacks from Images to Videos | Dec 10, 2021 | Video Recognition | —Unverified | 0 | 0 |
| DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments | Dec 28, 2024 | Action LocalizationAction Recognition | —Unverified | 0 | 0 |
| DeepGamble: Towards unlocking real-time player intelligence using multi-layer instance segmentation and attribute detection | Dec 14, 2020 | AttributeInstance Segmentation | —Unverified | 0 | 0 |
| Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data Is Continuous and Weakly Labelled | Jun 1, 2016 | Sign Language RecognitionVideo Recognition | —Unverified | 0 | 0 |
| Deep Networks With Large Output Spaces | Dec 23, 2014 | Video Recognition | —Unverified | 0 | 0 |
| Defending Against Multiple and Unforeseen Adversarial Videos | Sep 11, 2020 | Adversarial RobustnessGeneral Classification | —Unverified | 0 | 0 |
| A Novel Audio-Visual Information Fusion System for Mental Disorders Detection | Sep 3, 2024 | EEGVideo Recognition | —Unverified | 0 | 0 |
| Demonstration of Vector Flow Imaging using Convolutional Neural Networks | Mar 11, 2019 | Optical Flow EstimationVideo Recognition | —Unverified | 0 | 0 |
| Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification | Aug 12, 2017 | Action ClassificationGeneral Classification | —Unverified | 0 | 0 |
| Design Light-weight 3D Convolutional Networks for Video Recognition Temporal Residual, Fully Separable Block, and Fast Algorithm | May 31, 2019 | Video Recognition | —Unverified | 0 | 0 |
| V4D: 4D Convolutional Neural Networks for Video-level Representation Learning | May 1, 2020 | Representation LearningVideo Recognition | —Unverified | 0 | 0 |
| AdaFrame: Adaptive Frame Selection for Fast Video Recognition | Nov 29, 2018 | Policy Gradient MethodsVideo Recognition | —Unverified | 0 | 0 |
| Searching for Two-Stream Models in Multivariate Space for Video Recognition | Aug 30, 2021 | Neural Architecture SearchVideo Recognition | —Unverified | 0 | 0 |
| DistInit: Learning Video Representations Without a Single Labeled Video | Jan 26, 2019 | Action RecognitionTemporal Action Localization | —Unverified | 0 | 0 |
| Annotation-Efficient Untrimmed Video Action Recognition | Nov 30, 2020 | Action RecognitionContrastive Learning | —Unverified | 0 | 0 |
| Early Detection of Injuries in MLB Pitchers from Video | Apr 18, 2019 | Video Recognition | —Unverified | 0 | 0 |
| Efficient Attention-free Video Shift Transformers | Aug 23, 2022 | Action RecognitionVideo Recognition | —Unverified | 0 | 0 |
| Efficient Decision-based Black-box Patch Attacks on Video Recognition | Mar 21, 2023 | Video Recognition | —Unverified | 0 | 0 |
| View while Moving: Efficient Video Recognition in Long-untrimmed Videos | Aug 9, 2023 | Video Recognition | —Unverified | 0 | 0 |
| Efficient Selective Audio Masked Multimodal Bottleneck Transformer for Audio-Video Classification | Jan 8, 2024 | Action RecognitionContrastive Learning | —Unverified | 0 | 0 |
| Enhanced Multimodal Representation Learning with Cross-modal KD | Jun 13, 2023 | Contrastive LearningEmotion Classification | —Unverified | 0 | 0 |
| Enhance Visual Recognition under Adverse Conditions via Deep Networks | Dec 20, 2017 | Data AugmentationImage Restoration | —Unverified | 0 | 0 |
| EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2022: Team HNU-FPV Technical Report | Jul 7, 2022 | Action RecognitionDomain Adaptation | —Unverified | 0 | 0 |
| Alignment Distances on Systems of Bags | Jun 14, 2017 | DescriptiveDictionary Learning | —Unverified | 0 | 0 |
| Explainable Deep Learning for Video Recognition Tasks: A Framework & Recommendations | Sep 7, 2019 | Deep LearningVideo Recognition | —Unverified | 0 | 0 |
| Exploiting Images for Video Recognition with Hierarchical Generative Adversarial Networks | May 11, 2018 | Domain AdaptationVideo Recognition | —Unverified | 0 | 0 |
| Exploring Temporally Dynamic Data Augmentation for Video Recognition | Jun 30, 2022 | Action LocalizationAction Segmentation | —Unverified | 0 | 0 |
| Fast Adversarial Training with Weak-to-Strong Spatial-Temporal Consistency in the Frequency Domain on Videos | Apr 21, 2025 | Adversarial RobustnessVideo Recognition | —Unverified | 0 | 0 |
| Video4MRI: An Empirical Study on Brain Magnetic Resonance Image Analytics with CNN-based Video Classification Frameworks | Feb 24, 2023 | ClassificationData Augmentation | —Unverified | 0 | 0 |
| Fast Retinomorphic Event Stream for Video Recognition and Reinforcement Learning | May 16, 2018 | Action RecognitionAtari Games | —Unverified | 0 | 0 |
| Flow-Distilled IP Two-Stream Networks for Compressed Video Action Recognition | Dec 10, 2019 | Action RecognitionOptical Flow Estimation | —Unverified | 0 | 0 |
| FlowGraph2Text: Automatic Sentence Skeleton Compilation for Procedural Text Generation | Jun 1, 2014 | SentenceText Generation | —Unverified | 0 | 0 |
| Spatiotemporal Attention-based Semantic Compression for Real-time Video Recognition | May 22, 2023 | Action RecognitionDecoder | —Unverified | 0 | 0 |
| Gameplay Highlights Generation | May 12, 2025 | Event DetectionHighlight Detection | —Unverified | 0 | 0 |
| Generating Videos with Scene Dynamics | Sep 8, 2016 | Action ClassificationFuture prediction | —Unverified | 0 | 0 |
| Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning | Jun 1, 2018 | Action RecognitionRepresentation Learning | —Unverified | 0 | 0 |
| Standardization Trends on Safety and Trustworthiness Technology for Advanced AI | Oct 29, 2024 | Video Recognition | —Unverified | 0 | 0 |
| GTM: Gray Temporal Model for Video Recognition | Oct 20, 2021 | Action Recognitionmodel | —Unverified | 0 | 0 |