| TSPE: Task-Specific Prompt Ensemble for Improved Zero-Shot Audio Classification | Dec 31, 2024 | Audio ClassificationClassification | —Unverified | 0 |
| A sound description: Exploring prompt templates and class descriptions to enhance zero-shot audio classification | Sep 19, 2024 | Audio ClassificationClassification | —Unverified | 0 |
| ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds | Sep 13, 2024 | Audio ClassificationDescriptive | CodeCode Available | 1 |
| Multi-label Zero-Shot Audio Classification with Temporal Attention | Aug 31, 2024 | Audio ClassificationClassification | —Unverified | 0 |
| Enhancing Audio-Language Models through Self-Supervised Post-Training with Text-Audio Pairs | Aug 17, 2024 | Audio ClassificationContrastive Learning | CodeCode Available | 0 |
| Investigating the Emergent Audio Classification Ability of ASR Foundation Models | Nov 15, 2023 | Audio ClassificationDecoder | CodeCode Available | 0 |
| CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models | Oct 12, 2023 | AttributeAudio Classification | —Unverified | 0 |
| LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment | Oct 3, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 4 |
| Exploring Meta Information for Audio-based Zero-shot Bird Classification | Sep 15, 2023 | Audio ClassificationZero-shot Audio Classification | CodeCode Available | 0 |
| ImageBind: One Embedding Space To Bind Them All | May 9, 2023 | AllCross-Modal Retrieval | CodeCode Available | 5 |
| WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research | Mar 30, 2023 | Audio captioningEvent Detection | CodeCode Available | 2 |
| Zero-Shot Audio Classification using Image Embeddings | Jun 10, 2022 | Audio ClassificationClassification | —Unverified | 0 |
| Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer | Jan 16, 2022 | Audio ClassificationAudio Tagging | —Unverified | 0 |
| Connecting the Dots between Audio and Text without Parallel Data through Visual Knowledge Transfer | Dec 16, 2021 | Audio ClassificationAudio Tagging | CodeCode Available | 1 |
| Sound-Guided Semantic Image Manipulation | Nov 30, 2021 | Audio Classificationimage-classification | CodeCode Available | 1 |
| Zero-Shot Audio Classification with Factored Linear and Nonlinear Acoustic-Semantic Projections | Nov 25, 2020 | Audio ClassificationClassification | —Unverified | 0 |
| Zero-Shot Audio Classification via Semantic Embeddings | Nov 24, 2020 | Audio ClassificationClassification | —Unverified | 0 |
| Zero-Shot Audio Classification Based on Class Label Embeddings | May 6, 2019 | Audio ClassificationClassification | —Unverified | 0 |