| VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | Mar 23, 2022 | 4kAction Classification | CodeCode Available | 3 |
| VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking | Mar 29, 2023 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| Audio-Visual Instance Discrimination with Cross-Modal Agreement | Apr 27, 2020 | Action RecognitionAudio Classification | CodeCode Available | 1 |
| Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning | Dec 8, 2022 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition | Dec 7, 2021 | Action RecognitionContrastive Learning | CodeCode Available | 1 |
| Contrastive Multiview Coding | Jun 13, 2019 | Contrastive LearningSelf-Supervised Action Recognition | CodeCode Available | 1 |
| EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens | Nov 19, 2022 | Action RecognitionObject State Change Classification | CodeCode Available | 1 |
| Learning the Predictability of the Future | Jun 19, 2021 | Representation LearningSelf-Supervised Action Recognition | CodeCode Available | 1 |
| Broaden Your Views for Self-Supervised Video Learning | Mar 30, 2021 | Audio ClassificationOptical Flow Estimation | CodeCode Available | 1 |
| Masked Motion Encoding for Self-Supervised Video Representation Learning | Oct 12, 2022 | MMEOptical Flow Estimation | CodeCode Available | 1 |