| End-to-End Learning of Motion Representation for Video Understanding | Apr 2, 2018 | Action RecognitionOptical Flow Estimation | CodeCode Available | 0 |
| Joint Event Detection and Description in Continuous Video Streams | Feb 28, 2018 | Dense CaptioningDense Video Captioning | CodeCode Available | 0 |
| Detect-and-Track: Efficient Pose Estimation in Videos | Dec 26, 2017 | Human DetectionKeypoint Estimation | CodeCode Available | 0 |
| Attend and Interact: Higher-Order Object Interactions for Video Understanding | Nov 16, 2017 | Action ClassificationAction Recognition | —Unverified | 0 |
| Grounded Objects and Interactions for Video Captioning | Nov 16, 2017 | ObjectScene Understanding | —Unverified | 0 |
| End-to-End Video Classification with Knowledge Graphs | Nov 6, 2017 | BIG-bench Machine LearningClassification | —Unverified | 0 |
| Scene-centric Joint Parsing of Cross-view Videos | Sep 16, 2017 | Video Understanding | —Unverified | 0 |
| ElasticPlay: Interactive Video Summarization with Dynamic Time Budgets | Aug 23, 2017 | Video SummarizationVideo Understanding | —Unverified | 0 |
| Kill Two Birds With One Stone: Boosting Both Object Detection Accuracy and Speed With adaptive Patch-of-Interest Composition | Aug 12, 2017 | Objectobject-detection | —Unverified | 0 |
| Extensible Hierarchical Method of Detecting Interactive Actions for Video Understanding | Aug 11, 2017 | Action DetectionAction Recognition | —Unverified | 0 |
| Unsupervised Video Understanding by Reconciliation of Posture Similarities | Aug 3, 2017 | Action ClassificationRetrieval | —Unverified | 0 |
| Multi-kernel learning of deep convolutional features for action recognition | Jul 21, 2017 | Action RecognitionActivity Recognition | —Unverified | 0 |
| Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding | Jul 14, 2017 | Video RecognitionVideo Understanding | CodeCode Available | 0 |
| Cultivating DNN Diversity for Large Scale Video Labelling | Jul 13, 2017 | DiversityVideo Understanding | —Unverified | 0 |
| Hierarchical Deep Recurrent Architecture for Video Understanding | Jul 11, 2017 | ClassificationGeneral Classification | CodeCode Available | 0 |
| Video Representation Learning and Latent Concept Mining for Large-scale Multi-label Video Classification | Jul 5, 2017 | AttributeGeneral Classification | CodeCode Available | 0 |
| Aggregating Frame-level Features for Large-Scale Video Classification | Jul 4, 2017 | ClassificationGeneral Classification | —Unverified | 0 |
| Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals | Jul 1, 2017 | Video Understanding | —Unverified | 0 |
| Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos | Jul 1, 2017 | Action RecognitionAction Recognition In Videos | —Unverified | 0 |
| Generating the Future With Adversarial Transformers | Jul 1, 2017 | Video Understanding | —Unverified | 0 |
| The YouTube-8M Kaggle Competition: Challenges and Methods | Jun 28, 2017 | General ClassificationVideo Classification | CodeCode Available | 0 |
| An Effective Way to Improve YouTube-8M Classification Accuracy in Google Cloud Platform | Jun 26, 2017 | ClassificationDeep Learning | —Unverified | 0 |
| YouTube-8M Video Understanding Challenge Approach and Applications | Jun 26, 2017 | Ensemble LearningVideo Understanding | —Unverified | 0 |
| Learnable pooling with Context Gating for video classification | Jun 21, 2017 | ClassificationClustering | CodeCode Available | 0 |
| The Monkeytyping Solution to the YouTube-8M Video Understanding Challenge | Jun 16, 2017 | General ClassificationVideo Classification | CodeCode Available | 0 |
| Large-Scale YouTube-8M Video Understanding with Deep Neural Networks | Jun 14, 2017 | ClassificationGeneral Classification | —Unverified | 0 |
| Deep Learning Methods for Efficient Large Scale Video Labeling | Jun 14, 2017 | Deep LearningVideo Understanding | CodeCode Available | 0 |
| Learning without Prejudice: Avoiding Bias in Webly-Supervised Action Recognition | Jun 14, 2017 | Action RecognitionOptical Flow Estimation | —Unverified | 0 |
| Action Understanding with Multiple Classes of Actors | Apr 27, 2017 | Action RecognitionAction Segmentation | —Unverified | 0 |
| Video Object Segmentation using Supervoxel-Based Gerrymandering | Apr 18, 2017 | ObjectSemantic Segmentation | CodeCode Available | 0 |
| TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition | Mar 30, 2017 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Temporal Tessellation: A Unified Approach for Video Analysis | Dec 21, 2016 | Action DetectionVideo Captioning | CodeCode Available | 0 |
| Real-Time Video Highlights for Yahoo Esports | Nov 27, 2016 | CPUDota 2 | —Unverified | 0 |
| Generating Videos with Scene Dynamics | Sep 8, 2016 | Action ClassificationFuture prediction | —Unverified | 0 |
| VideoMCC: a New Benchmark for Video Comprehension | Jun 23, 2016 | Multiple-choiceVideo Description | —Unverified | 0 |
| Slicing Convolutional Neural Network for Crowd Video Understanding | Jun 1, 2016 | AttributeVideo Understanding | —Unverified | 0 |
| MSR-VTT: A Large Video Description Dataset for Bridging Video and Language | Jun 1, 2016 | Image CaptioningSentence | —Unverified | 0 |
| Harnessing Object and Scene Semantics for Large-Scale Video Understanding | Jun 1, 2016 | Action RecognitionClustering | —Unverified | 0 |
| The THUMOS Challenge on Action Recognition for Videos "in the Wild" | Apr 21, 2016 | Action ClassificationAction Recognition | —Unverified | 0 |
| The Open World of Micro-Videos | Mar 31, 2016 | DiversityTAG | —Unverified | 0 |
| Actor-Action Semantic Segmentation with Grouping Process Models | Dec 30, 2015 | Semantic SegmentationVideo Understanding | —Unverified | 0 |
| Mid-level Representation for Visual Recognition | Dec 23, 2015 | object-detectionObject Detection | —Unverified | 0 |
| Fine-Grain Annotation of Cricket Videos | Nov 24, 2015 | Action RecognitionRetrieval | —Unverified | 0 |
| Person Count Localization in Videos From Noisy Foreground and Detections | Jun 1, 2015 | Foreground SegmentationHuman Detection | —Unverified | 0 |
| Unsupervised Object Discovery and Tracking in Video Collections | May 14, 2015 | ObjectObject Discovery | —Unverified | 0 |
| Learning from Multiple Sources for Video Summarisation | Jan 13, 2015 | ClusteringVideo Understanding | —Unverified | 0 |
| Pooled Motion Features for First-Person Videos | Dec 19, 2014 | Activity RecognitionActivity Recognition In Videos | CodeCode Available | 0 |
| Weakly Supervised Multiclass Video Segmentation | Jun 1, 2014 | SegmentationSemantic Similarity | —Unverified | 0 |
| Grounding Action Descriptions in Videos | Jan 1, 2013 | Semantic Textual SimilarityVideo Understanding | —Unverified | 0 |