| VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | Mar 23, 2022 | 4kAction Classification | CodeCode Available | 3 |
| Learning Video Representations from Large Language Models | Dec 8, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning | Nov 25, 2022 | Action ClassificationClassification | CodeCode Available | 1 |
| Self-supervised Video Transformer | Dec 2, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples | Mar 10, 2021 | Action RecognitionContrastive Learning | CodeCode Available | 1 |
| Self-supervised Co-training for Video Representation Learning | Oct 19, 2020 | Action RecognitionContrastive Learning | CodeCode Available | 1 |
| Spatiotemporal Contrastive Video Representation Learning | Aug 9, 2020 | Action RecognitionContrastive Learning | CodeCode Available | 1 |
| Language-based Action Concept Spaces Improve Video Self-Supervised Learning | Jul 20, 2023 | Action RecognitionConcept Alignment | —Unverified | 0 |
| Vi2CLR: Video and Image for Visual Contrastive Learning of Representation | Jan 1, 2021 | Action RecognitionClustering | —Unverified | 0 |
| Contrast and Order Representations for Video Self-Supervised Learning | Jan 1, 2021 | Action RecognitionSelf-Supervised Action Recognition Linear | —Unverified | 0 |