| Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context | Mar 19, 2025 | Audio captioningAudio Question Answering | CodeCode Available | 0 |
| Audiopedia: Audio QA with Knowledge | Dec 29, 2024 | Audio Question AnsweringEntity Linking | CodeCode Available | 0 |
| Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models | Sep 10, 2024 | Audio captioningAudio Question Answering | —Unverified | 0 |
| Audio Dialogues: Dialogues dataset for audio and music understanding | Apr 11, 2024 | Audio captioningAudio Question Answering | —Unverified | 0 |
| AQUALLM: Audio Question Answering Data Generation Using Large Language Models | Dec 28, 2023 | Audio Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Attention-Based Methods For Audio Question Answering | May 31, 2023 | Audio Question AnsweringBinary Classification | —Unverified | 0 |
| Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering | Apr 20, 2022 | Audio Question AnsweringQuestion Answering | —Unverified | 0 |
| Temporal Reasoning via Audio Question Answering | Nov 21, 2019 | Audio Question AnsweringDiagnostic | CodeCode Available | 0 |