| Actor-agnostic Multi-label Action Recognition with Multi-modal Query | Jul 20, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting | Apr 6, 2023 | Action RecognitionPrompt Learning | CodeCode Available | 1 |
| EVA-CLIP: Improved Training Techniques for CLIP at Scale | Mar 27, 2023 | Image ClassificationRepresentation Learning | CodeCode Available | 1 |
| MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge | Mar 15, 2023 | Action RecognitionFew-Shot action recognition | CodeCode Available | 1 |
| A CLIP-Hitchhiker's Guide to Long Video Retrieval | May 17, 2022 | RetrievalVideo Retrieval | CodeCode Available | 1 |
| MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval | Apr 26, 2022 | Action RecognitionRetrieval | CodeCode Available | 1 |
| Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification | Mar 29, 2022 | Representation LearningVideo Classification | CodeCode Available | 1 |
| Bridging Video-text Retrieval with Multiple Choice Questions | Jan 13, 2022 | Action RecognitionLinear evaluation | CodeCode Available | 1 |
| Tell me what you see: A zero-shot action recognition method based on natural language descriptions | Dec 18, 2021 | Action RecognitionDescriptive | CodeCode Available | 1 |
| ActionCLIP: A New Paradigm for Video Action Recognition | Sep 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |