| Cross-Modal and Hierarchical Modeling of Video and Text | Oct 16, 2018 | Action RecognitionRetrieval | CodeCode Available | 0 | 5 |
| Orthogonal Temporal Interpolation for Zero-Shot Video Recognition | Aug 14, 2023 | Video RecognitionZero-Shot Action Recognition | CodeCode Available | 0 | 5 |
| Rethinking Zero-shot Action Recognition: Learning from Latent Atomic Actions | Mar 28, 2022 | Action RecognitionZero-Shot Action Recognition | CodeCode Available | 0 | 5 |
| Learning a Deep Embedding Model for Zero-Shot Learning | Nov 15, 2016 | Image CaptioningSentence | CodeCode Available | 0 | 5 |
| Label-Embedding for Image Classification | Mar 30, 2015 | AttributeClassification | CodeCode Available | 0 | 5 |
| InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation | Jul 13, 2023 | Action RecognitionContrastive Learning | CodeCode Available | 0 | 5 |
| Global Semantic Descriptors for Zero-Shot Action Recognition | Sep 24, 2022 | Action ClassificationAction Recognition | CodeCode Available | 0 | 5 |
| LoCATe-GAT: Modeling Multi-Scale Local Context and Action Relationships for Zero-Shot Action Recognition | Nov 27, 2024 | Action RecognitionGraph Attention | CodeCode Available | 0 | 5 |
| I Know the Relationships: Zero-Shot Action Recognition via Two-Stream Graph Convolutional Networks and Knowledge Graphs | Jul 17, 2019 | Action RecognitionAttribute | CodeCode Available | 0 | 5 |
| End-to-End Semantic Video Transformer for Zero-Shot Action Recognition | Mar 10, 2022 | Action RecognitionTemporal Action Localization | CodeCode Available | 0 | 5 |