| Representation Learning on Visual-Symbolic Graphs for Video Understanding | May 17, 2019 | Action ClassificationAction Detection | —Unverified | 0 |
| Video Instance Segmentation | May 12, 2019 | Instance SegmentationSegmentation | CodeCode Available | 2 |
| Large Scale Holistic Video Understanding | Apr 25, 2019 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Recurrent Space-time Graph Neural Networks | Apr 11, 2019 | Action RecognitionHuman-Object Interaction Detection | CodeCode Available | 0 |
| Constructing Hierarchical Q&A Datasets for Video Story Understanding | Apr 1, 2019 | Video Understanding | —Unverified | 0 |
| Wasserstein Dependency Measure for Representation Learning | Mar 28, 2019 | Object Recognitionreinforcement-learning | —Unverified | 0 |
| 4D Generic Video Object Proposals | Jan 26, 2019 | Instance SegmentationObject | CodeCode Available | 0 |
| DMC-Net: Generating Discriminative Motion Cues for Fast Compressed Video Action Recognition | Jan 11, 2019 | Action ClassificationAction Recognition | —Unverified | 0 |
| Future semantic segmentation of time-lapsed videos with large temporal displacement | Dec 27, 2018 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Dynamic Graph Modules for Modeling Object-Object Interactions in Activity Recognition | Dec 13, 2018 | 3D Action RecognitionAction Recognition | —Unverified | 0 |
| Long-Term Feature Banks for Detailed Video Understanding | Dec 12, 2018 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| A Structured Model For Action Detection | Dec 9, 2018 | Action Detectionmodel | —Unverified | 0 |
| An Attempt towards Interpretable Audio-Visual Video Captioning | Dec 7, 2018 | Audio captioningAudio-Visual Video Captioning | —Unverified | 0 |
| The Visual Centrifuge: Model-Free Layered Video Representations | Dec 4, 2018 | Color Constancymodel | CodeCode Available | 0 |
| How to Make a BLT Sandwich? Learning to Reason towards Understanding Web Instructional Videos | Dec 2, 2018 | Logical ReasoningQuestion Answering | —Unverified | 0 |
| Self-Supervised Spatiotemporal Feature Learning via Video Rotation Prediction | Nov 28, 2018 | Action RecognitionPrediction | —Unverified | 0 |
| Integrated Object Detection and Tracking with Tracklet-Conditioned Detection | Nov 27, 2018 | Objectobject-detection | —Unverified | 0 |
| Efficient Video Understanding via Layered Multi Frame-Rate Analysis | Nov 24, 2018 | Autonomous DrivingVideo Understanding | —Unverified | 0 |
| TSM: Temporal Shift Module for Efficient Video Understanding | Nov 20, 2018 | 3D Action RecognitionAction Classification | CodeCode Available | 1 |
| NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features for Large-scale Video Classification | Nov 12, 2018 | Efficient Neural NetworkGeneral Classification | CodeCode Available | 0 |
| Random Temporal Skipping for Multirate Video Analysis | Oct 30, 2018 | Action RecognitionOptical Flow Estimation | —Unverified | 0 |
| Morph: Flexible Acceleration for 3D CNN-based Video Understanding | Oct 16, 2018 | MORPHVideo Recognition | —Unverified | 0 |
| Unsupervised Adversarial Visual Level Domain Adaptation for Learning Video Object Detectors from Images | Oct 4, 2018 | Domain AdaptationImage-to-Image Translation | CodeCode Available | 0 |
| Representation Flow for Action Recognition | Oct 2, 2018 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Learnable Pooling Methods for Video Classification | Oct 1, 2018 | ClassificationGeneral Classification | CodeCode Available | 0 |