| The Role of Video Generation in Enhancing Data-Limited Action Understanding | May 26, 2025 | Action RecognitionAction Understanding | —Unverified | 0 |
| Can masking background and object reduce static bias for zero-shot action recognition? | Jan 22, 2025 | Action RecognitionZero-Shot Action Recognition | —Unverified | 0 |
| Semantic-guided Cross-Modal Prompt Learning for Skeleton-based Zero-shot Action Recognition | Jan 1, 2025 | Action RecognitionComputational Efficiency | —Unverified | 0 |
| Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP | Dec 13, 2024 | Action RecognitionText Augmentation | CodeCode Available | 1 |
| LoCATe-GAT: Modeling Multi-Scale Local Context and Action Relationships for Zero-Shot Action Recognition | Nov 27, 2024 | Action RecognitionGraph Attention | CodeCode Available | 0 |
| TDSM: Triplet Diffusion for Skeleton-Text Matching in Zero-Shot Action Recognition | Nov 16, 2024 | Action RecognitionSkeleton Based Action Recognition | CodeCode Available | 1 |
| Zero-Shot Action Recognition in Surveillance Videos | Oct 28, 2024 | Action RecognitionVideo Understanding | —Unverified | 0 |
| Continual Learning Improves Zero-Shot Action Recognition | Oct 14, 2024 | Action RecognitionContinual Learning | —Unverified | 0 |
| Zero-Shot Skeleton-based Action Recognition with Dual Visual-Text Alignment | Sep 22, 2024 | Action RecognitionMetric Learning | —Unverified | 0 |
| Text-Enhanced Zero-Shot Action Recognition: A training-free approach | Aug 29, 2024 | Action RecognitionTemporal Action Localization | —Unverified | 0 |
| Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition | Jun 19, 2024 | Action RecognitionSkeleton Based Action Recognition | CodeCode Available | 1 |
| An Information Compensation Framework for Zero-Shot Skeleton-based Action Recognition | Jun 2, 2024 | Action RecognitionEnsemble Learning | —Unverified | 0 |
| A Cross-Dataset Study for Text-based 3D Human Motion Retrieval | May 27, 2024 | Action RecognitionRetrieval | —Unverified | 0 |
| The impact of Compositionality in Zero-shot Multi-label action recognition for Object-based tasks | May 14, 2024 | Action RecognitionAction Recognition In Videos | —Unverified | 0 |
| Leveraging Temporal Contextualization for Video Action Recognition | Apr 15, 2024 | Action RecognitionTemporal Action Localization | CodeCode Available | 2 |
| Fine-Grained Side Information Guided Dual-Prompts for Zero-Shot Skeleton Action Recognition | Apr 11, 2024 | Action RecognitionAttribute | —Unverified | 0 |
| ActionHub: A Large-scale Action Video Description Dataset for Zero-shot Action Recognition | Jan 22, 2024 | Action RecognitionVideo Description | —Unverified | 0 |
| EZ-CLIP: Efficient Zeroshot Video Action Recognition | Dec 13, 2023 | Action RecognitionGPU | CodeCode Available | 1 |
| OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition | Nov 30, 2023 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment | Oct 3, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 4 |
| Telling Stories for Common Sense Zero-Shot Action Recognition | Sep 29, 2023 | Action RecognitionArticles | CodeCode Available | 0 |
| Orthogonal Temporal Interpolation for Zero-Shot Video Recognition | Aug 14, 2023 | Video RecognitionZero-Shot Action Recognition | CodeCode Available | 0 |
| Actor-agnostic Multi-label Action Recognition with Multi-modal Query | Jul 20, 2023 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation | Jul 13, 2023 | Action RecognitionContrastive Learning | CodeCode Available | 0 |
| Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception | May 10, 2023 | Classificationimage-classification | —Unverified | 0 |