| TSPE: Task-Specific Prompt Ensemble for Improved Zero-Shot Audio Classification | Dec 31, 2024 | Audio ClassificationClassification | —Unverified | 0 |
| A sound description: Exploring prompt templates and class descriptions to enhance zero-shot audio classification | Sep 19, 2024 | Audio ClassificationClassification | —Unverified | 0 |
| ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds | Sep 13, 2024 | Audio ClassificationDescriptive | CodeCode Available | 1 |
| Multi-label Zero-Shot Audio Classification with Temporal Attention | Aug 31, 2024 | Audio ClassificationClassification | —Unverified | 0 |
| Enhancing Audio-Language Models through Self-Supervised Post-Training with Text-Audio Pairs | Aug 17, 2024 | Audio ClassificationContrastive Learning | CodeCode Available | 0 |
| Investigating the Emergent Audio Classification Ability of ASR Foundation Models | Nov 15, 2023 | Audio ClassificationDecoder | CodeCode Available | 0 |
| CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models | Oct 12, 2023 | AttributeAudio Classification | —Unverified | 0 |
| LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment | Oct 3, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 4 |
| Exploring Meta Information for Audio-based Zero-shot Bird Classification | Sep 15, 2023 | Audio ClassificationZero-shot Audio Classification | CodeCode Available | 0 |
| ImageBind: One Embedding Space To Bind Them All | May 9, 2023 | AllCross-Modal Retrieval | CodeCode Available | 5 |