| Representation Flow for Action Recognition | Oct 2, 2018 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Learnable Pooling Methods for Video Classification | Oct 1, 2018 | ClassificationGeneral Classification | CodeCode Available | 0 |
| Non-local NetVLAD Encoding for Video Classification | Sep 29, 2018 | ClassificationGeneral Classification | —Unverified | 0 |
| Large-Scale Video Classification with Feature Space Augmentation coupled with Learned Label Relations and Ensembling | Sep 21, 2018 | General ClassificationVideo Classification | —Unverified | 0 |
| Label Denoising with Large Ensembles of Heterogeneous Neural Networks | Sep 12, 2018 | Data AugmentationDenoising | —Unverified | 0 |
| Localizing Moments in Video with Temporal Language | Sep 5, 2018 | Natural Language QueriesRetrieval | CodeCode Available | 0 |
| End-to-End Joint Semantic Segmentation of Actors and Actions in Video | Sep 1, 2018 | Action RecognitionSegmentation | —Unverified | 0 |
| Teaching Machines to Understand Baseball Games: Large-Scale Baseball Video Database for Multiple Video Understanding Tasks | Sep 1, 2018 | Video AlignmentVideo Recognition | —Unverified | 0 |
| Constrained-size Tensorflow Models for YouTube-8M Video Understanding Challenge | Aug 21, 2018 | Video Understanding | CodeCode Available | 0 |
| Diagnosing Error in Temporal Action Detectors | Jul 27, 2018 | Action LocalizationDiagnostic | CodeCode Available | 0 |
| Video Time: Properties, Encoders and Evaluation | Jul 18, 2018 | Video Understanding | —Unverified | 0 |
| Query-Conditioned Three-Player Adversarial Network for Video Summarization | Jul 17, 2018 | Generative Adversarial NetworkVideo Summarization | —Unverified | 0 |
| When Work Matters: Transforming Classical Network Structures to Graph CNN | Jul 7, 2018 | Graph ClassificationVideo Understanding | —Unverified | 0 |
| Long Activity Video Understanding using Functional Object-Oriented Network | Jul 3, 2018 | ObjectVideo Understanding | —Unverified | 0 |
| Deep Spatio-Temporal Random Fields for Efficient Video Segmentation | Jul 3, 2018 | Instance SegmentationSemantic Segmentation | —Unverified | 0 |
| Exploiting Spatial-Temporal Modelling and Multi-Modal Fusion for Human Action Recognition | Jun 27, 2018 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| Massively Parallel Video Networks | Jun 11, 2018 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| What Makes a Video a Video: Analyzing Temporal Information in Video Understanding Models and Datasets | Jun 1, 2018 | Video Understanding | —Unverified | 0 |
| Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning | Jun 1, 2018 | Action RecognitionRepresentation Learning | —Unverified | 0 |
| DenseImage Network: Video Spatial-Temporal Evolution Encoding and Understanding | May 19, 2018 | Action Recognition In VideosGesture Recognition | —Unverified | 0 |
| Fast Retinomorphic Event Stream for Video Recognition and Reinforcement Learning | May 16, 2018 | Action RecognitionAtari Games | —Unverified | 0 |
| Dilated Temporal Relational Adversarial Network for Generic Video Summarization | Apr 30, 2018 | Generative Adversarial NetworkVideo Summarization | —Unverified | 0 |
| Charades-Ego: A Large-Scale Dataset of Paired Third and First Person Videos | Apr 25, 2018 | General ClassificationVideo Classification | —Unverified | 0 |
| ECO: Efficient Convolutional Network for Online Video Understanding | Apr 24, 2018 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning | Apr 15, 2018 | Video CaptioningVideo Understanding | CodeCode Available | 0 |