| Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers | Jun 9, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| CT-Net: Channel Tensorization Network for Video Classification | Jun 3, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Continual 3D Convolutional Neural Networks for Real-time Processing of Videos | May 31, 2021 | Action ClassificationVideo Recognition | CodeCode Available | 1 |
| Distributed Learning with Strategic Users: A Repeated Game Approach | May 21, 2021 | Action Classification | —Unverified | 0 |
| VPN++: Rethinking Video-Pose embeddings for understanding Activities of Daily Living | May 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Representation Learning via Global Temporal Alignment and Cycle-Consistency | May 11, 2021 | Action ClassificationDynamic Time Warping | CodeCode Available | 1 |
| Unsupervised Visual Representation Learning by Tracking Patches in Video | May 6, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| VidTr: Video Transformer Without Convolutions | Apr 23, 2021 | Action ClassificationAction Recognition | —Unverified | 0 |
| Multiscale Vision Transformers | Apr 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text | Apr 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |