| FrameExit: Conditional Early Exiting for Efficient Video Recognition | Apr 27, 2021 | Video RecognitionVideo Understanding | CodeCode Available | 1 |
| Frame Flexible Network | Mar 26, 2023 | Video Recognition | CodeCode Available | 1 |
| Frozen CLIP Models are Efficient Video Learners | Aug 6, 2022 | Action ClassificationDecoder | CodeCode Available | 1 |
| Adapting Short-Term Transformers for Action Detection in Untrimmed Videos | Dec 4, 2023 | Action DetectionVideo Recognition | CodeCode Available | 1 |
| Long Movie Clip Classification with State-Space Video Models | Apr 4, 2022 | ClassificationDecoder | CodeCode Available | 1 |
| Generalized Few-Shot Video Classification with Video Retrieval and Feature Generation | Jul 9, 2020 | Few-Shot Image ClassificationFew-Shot Learning | CodeCode Available | 1 |
| Audio-Visual Class-Incremental Learning | Aug 21, 2023 | class-incremental learningClass Incremental Learning | CodeCode Available | 1 |
| Real-time Online Video Detection with Temporal Smoothing Transformers | Sep 19, 2022 | Action AnticipationAction Detection | CodeCode Available | 1 |
| Temporal-attentive Covariance Pooling Networks for Video Recognition | Oct 27, 2021 | Video Recognition | CodeCode Available | 1 |
| Glance and Focus Networks for Dynamic Visual Recognition | Jan 9, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Group Contextualization for Video Recognition | Mar 18, 2022 | Action RecognitionEgocentric Activity Recognition | CodeCode Available | 1 |
| VLG: General Video Recognition with Web Textual Knowledge | Dec 3, 2022 | Video Recognition | CodeCode Available | 1 |
| Demonstration of Vector Flow Imaging using Convolutional Neural Networks | Mar 11, 2019 | Optical Flow EstimationVideo Recognition | —Unverified | 0 |
| Image and Video Mining through Online Learning | Sep 9, 2016 | Action RecognitionActive Learning | —Unverified | 0 |
| Action Keypoint Network for Efficient Video Recognition | Jan 17, 2022 | Action RecognitionPoint Cloud Classification | —Unverified | 0 |
| Deep Networks With Large Output Spaces | Dec 23, 2014 | Video Recognition | —Unverified | 0 |
| HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition | Apr 20, 2021 | Video Recognition | —Unverified | 0 |
| Higher-order Network for Action Recognition | Nov 19, 2018 | Action RecognitionGeneral Classification | —Unverified | 0 |
| 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition | Dec 29, 2020 | Action RecognitionPolicy Gradient Methods | —Unverified | 0 |
| Multi-Fiber Networks for Video Recognition | Jul 30, 2018 | Action ClassificationAction Recognition | —Unverified | 0 |
| Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data Is Continuous and Weakly Labelled | Jun 1, 2016 | Sign Language RecognitionVideo Recognition | —Unverified | 0 |
| Defending Against Multiple and Unforeseen Adversarial Videos | Sep 11, 2020 | Adversarial RobustnessGeneral Classification | —Unverified | 0 |
| Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions | May 28, 2024 | Action RecognitionVideo Recognition | —Unverified | 0 |
| MRET: Multi-resolution Transformer for Video Quality Assessment | Mar 13, 2023 | Video Quality AssessmentVideo Recognition | —Unverified | 0 |
| Audio-Visual Glance Network for Efficient Video Recognition | Aug 18, 2023 | Video RecognitionVideo Understanding | —Unverified | 0 |
| DeepGamble: Towards unlocking real-time player intelligence using multi-layer instance segmentation and attribute detection | Dec 14, 2020 | AttributeInstance Segmentation | —Unverified | 0 |
| MultAV: Multiplicative Adversarial Videos | Sep 17, 2020 | Adversarial AttackVideo Recognition | —Unverified | 0 |
| Audio-Visual Fusion Layers for Event Type Aware Video Recognition | Feb 12, 2022 | Multi-Task LearningVideo Recognition | —Unverified | 0 |
| GTM: Gray Temporal Model for Video Recognition | Oct 20, 2021 | Action Recognitionmodel | —Unverified | 0 |
| DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments | Dec 28, 2024 | Action LocalizationAction Recognition | —Unverified | 0 |
| Adaptive Detrending to Accelerate Convolutional Gated Recurrent Unit Training for Contextual Video Recognition | May 24, 2017 | Video Recognition | —Unverified | 0 |
| Multi-Agent Reinforcement Learning Based Frame Sampling for Effective Untrimmed Video Recognition | Jul 31, 2019 | Action RecognitionGeneral Classification | —Unverified | 0 |
| Multi Modal Convolutional Neural Networks for Brain Tumor Segmentation | Sep 17, 2018 | Brain Tumor SegmentationSegmentation | —Unverified | 0 |
| Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning | Jun 1, 2018 | Action RecognitionRepresentation Learning | —Unverified | 0 |
| Cross-Modal Transferable Adversarial Attacks from Images to Videos | Dec 10, 2021 | Video Recognition | —Unverified | 0 |
| Generating Videos with Scene Dynamics | Sep 8, 2016 | Action ClassificationFuture prediction | —Unverified | 0 |
| Cross-Block Fine-Grained Semantic Cascade for Skeleton-Based Sports Action Recognition | Apr 30, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| Morph: Flexible Acceleration for 3D CNN-based Video Understanding | Oct 16, 2018 | MORPHVideo Recognition | —Unverified | 0 |
| Correlation Net: Spatiotemporal multimodal deep learning for action recognition | Jul 22, 2018 | Action RecognitionDeep Learning | —Unverified | 0 |
| Convolutional Neural Network on Three Orthogonal Planes for Dynamic Texture Classification | Mar 16, 2017 | General ClassificationRetrieval | —Unverified | 0 |
| Gameplay Highlights Generation | May 12, 2025 | Event DetectionHighlight Detection | —Unverified | 0 |
| A two-way translation system of Chinese sign language based on computer vision | Jun 3, 2023 | SentenceSign Language Recognition | —Unverified | 0 |
| Motion-Augmented Self-Training for Video Recognition at Smaller Scale | May 4, 2021 | Action RecognitionOptical Flow Estimation | —Unverified | 0 |
| Condensing a Sequence to One Informative Frame for Video Recognition | Jan 11, 2022 | Motion Estimationvalid | —Unverified | 0 |
| Action Detail Matters: Refining Video Recognition with Local Action Queries | Jan 1, 2025 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| Modeling Sub-Event Dynamics in First-Person Action Recognition | Jul 1, 2017 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| FlowGraph2Text: Automatic Sentence Skeleton Compilation for Procedural Text Generation | Jun 1, 2014 | SentenceText Generation | —Unverified | 0 |
| Flow-Distilled IP Two-Stream Networks for Compressed Video Action Recognition | Dec 10, 2019 | Action RecognitionOptical Flow Estimation | —Unverified | 0 |
| Compositional Few-Shot Recognition with Primitive Discovery and Enhancing | May 12, 2020 | Few-Shot Image ClassificationFew-Shot Learning | —Unverified | 0 |
| Fast Retinomorphic Event Stream for Video Recognition and Reinforcement Learning | May 16, 2018 | Action RecognitionAtari Games | —Unverified | 0 |