| GTM: Gray Temporal Model for Video Recognition | Oct 20, 2021 | Action Recognitionmodel | —Unverified | 0 |
| QTTNet: Quantized Tensor Train Neural Networks for 3D Object and Video Recognition. | Sep 20, 2021 | QuantizationVideo Recognition | CodeCode Available | 0 |
| Large-vocabulary Audio-visual Speech Recognition in Noisy Environments | Sep 10, 2021 | Audio-Visual Speech RecognitionLipreading | —Unverified | 0 |
| Revisiting 3D ResNets for Video Recognition | Sep 3, 2021 | Action ClassificationContrastive Learning | CodeCode Available | 0 |
| Towards Learning a Vocabulary of Visual Concepts and Operators using Deep Neural Networks | Sep 1, 2021 | Video Recognition | —Unverified | 0 |
| Searching for Two-Stream Models in Multivariate Space for Video Recognition | Aug 30, 2021 | Neural Architecture SearchVideo Recognition | —Unverified | 0 |
| Reinforcement Learning Based Sparse Black-box Adversarial Attack on Video Recognition Models | Aug 29, 2021 | Adversarial Attackreinforcement-learning | —Unverified | 0 |
| Towards Efficient Tensor Decomposition-Based DNN Model Compression with Optimization Framework | Jul 26, 2021 | image-classificationImage Classification | —Unverified | 0 |
| Inter-intra Variant Dual Representations forSelf-supervised Video Recognition | Jul 2, 2021 | Contrastive LearningRepresentation Learning | CodeCode Available | 0 |
| VidHarm: A Clip Based Dataset for Harmful Content Detection | Jun 15, 2021 | Video Recognition | —Unverified | 0 |
| Motion-Augmented Self-Training for Video Recognition at Smaller Scale | May 4, 2021 | Action RecognitionOptical Flow Estimation | —Unverified | 0 |
| The Influence of Audio on Video Memorability with an Audio Gestalt Regulated Video Memorability System | Apr 23, 2021 | Multimodal Deep LearningVideo Recognition | —Unverified | 0 |
| HCMS: Hierarchical and Conditional Modality Selection for Efficient Video Recognition | Apr 20, 2021 | Video Recognition | —Unverified | 0 |
| Towards Extremely Compact RNNs for Video Recognition with Fully Decomposed Hierarchical Tucker Structure | Apr 12, 2021 | Tensor DecompositionVideo Recognition | —Unverified | 0 |
| On the Pitfalls of Learning with Limited Data: A Facial Expression Recognition Case Study | Apr 2, 2021 | Data AugmentationDeep Learning | —Unverified | 0 |
| Multiview Pseudo-Labeling for Semi-supervised Learning from Video | Apr 1, 2021 | Representation LearningVideo Recognition | —Unverified | 0 |
| Recognizing Actions in Videos from Unseen Viewpoints | Mar 30, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |
| Video Transformer Network | Feb 1, 2021 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Multi-Modal Multi-Action Video Recognition | Jan 1, 2021 | RelationVideo Recognition | CodeCode Available | 0 |
| Interactive Prototype Learning for Egocentric Action Recognition | Jan 1, 2021 | Action RecognitionObject | —Unverified | 0 |
| 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition | Dec 29, 2020 | Action RecognitionPolicy Gradient Methods | —Unverified | 0 |
| DeepGamble: Towards unlocking real-time player intelligence using multi-layer instance segmentation and attribute detection | Dec 14, 2020 | AttributeInstance Segmentation | —Unverified | 0 |
| Overcomplete Representations Against Adversarial Videos | Dec 8, 2020 | Adversarial RobustnessDecoder | CodeCode Available | 0 |
| Open-Ended Multi-Modal Relational Reasoning for Video Question Answering | Dec 1, 2020 | Question AnsweringRelational Reasoning | CodeCode Available | 0 |
| Annotation-Efficient Untrimmed Video Action Recognition | Nov 30, 2020 | Action RecognitionContrastive Learning | —Unverified | 0 |
| 11 TeraFLOPs per second photonic convolutional accelerator for deep learning optical neural networks | Nov 14, 2020 | Board GamesMedical Diagnosis | —Unverified | 0 |
| PV-NAS: Practical Neural Architecture Search for Video Recognition | Nov 2, 2020 | Neural Architecture SearchVideo Recognition | —Unverified | 0 |
| MultAV: Multiplicative Adversarial Videos | Sep 17, 2020 | Adversarial AttackVideo Recognition | —Unverified | 0 |
| Defending Against Multiple and Unforeseen Adversarial Videos | Sep 11, 2020 | Adversarial RobustnessGeneral Classification | —Unverified | 0 |
| Kronecker CP Decomposition with Fast Multiplication for Compressing RNNs | Aug 21, 2020 | Tensor DecompositionVideo Recognition | —Unverified | 0 |
| Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a Video | Aug 6, 2020 | Video Recognition | CodeCode Available | 0 |
| Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition | Jun 1, 2020 | Few-Shot LearningLong-tail Learning | —Unverified | 0 |
| Compositional Few-Shot Recognition with Primitive Discovery and Enhancing | May 12, 2020 | Few-Shot Image ClassificationFew-Shot Learning | —Unverified | 0 |
| V4D: 4D Convolutional Neural Networks for Video-level Representation Learning | May 1, 2020 | Representation LearningVideo Recognition | —Unverified | 0 |
| DriftNet: Aggressive Driving Behavior Classification using 3D EfficientNet Architecture | Apr 18, 2020 | Anomaly DetectionClassification | CodeCode Available | 0 |
| BosphorusSign22k Sign Language Recognition Dataset | Apr 2, 2020 | Sign Language ProductionSign Language Recognition | —Unverified | 0 |
| Symbiotic Attention with Privileged Information for Egocentric Action Recognition | Feb 8, 2020 | Action RecognitionEgocentric Activity Recognition | —Unverified | 0 |
| Audiovisual SlowFast Networks for Video Recognition | Jan 23, 2020 | Action ClassificationVideo Recognition | CodeCode Available | 0 |
| Sparse Black-box Video Attack with Reinforcement Learning | Jan 11, 2020 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| Flow-Distilled IP Two-Stream Networks for Compressed Video Action Recognition | Dec 10, 2019 | Action RecognitionOptical Flow Estimation | —Unverified | 0 |
| LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition | Dec 3, 2019 | Video Recognition | —Unverified | 0 |
| Learning Efficient Video Representation with Video Shuffle Networks | Nov 26, 2019 | Video Recognition | —Unverified | 0 |
| TEINet: Towards an Efficient Architecture for Video Recognition | Nov 21, 2019 | Action RecognitionVideo Recognition | —Unverified | 0 |
| Heuristic Black-box Adversarial Attacks on Video Recognition Models | Nov 21, 2019 | Adversarial AttackVideo Recognition | CodeCode Available | 0 |
| Mimic The Raw Domain: Accelerating Action Recognition in the Compressed Domain | Nov 19, 2019 | Action RecognitionVideo Recognition | —Unverified | 0 |
| Coverage Guided Testing for Recurrent Neural Networks | Nov 5, 2019 | Defect DetectionDrug Discovery | CodeCode Available | 0 |
| Learning to Localize Temporal Events in Large-scale Video Data | Oct 25, 2019 | Temporal LocalizationVideo Recognition | CodeCode Available | 0 |
| Training Kinetics in 15 Minutes: Large-scale Distributed Training on Videos | Oct 1, 2019 | GPUVideo Recognition | CodeCode Available | 0 |
| A Video Recognition Method by using Adaptive Structural Learning of Long Short Term Memory based Deep Belief Network | Sep 30, 2019 | Time SeriesTime Series Analysis | —Unverified | 0 |
| Scheduled Differentiable Architecture Search for Visual Recognition | Sep 23, 2019 | Video Recognition | —Unverified | 0 |