| ALLM4ADD: Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection | May 16, 2025 | Audio Deepfake DetectionAudio Question Answering | —Unverified | 0 |
| Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM? | May 14, 2025 | Audio Question AnsweringQuestion Answering | —Unverified | 0 |
| Multi-Domain Audio Question Answering Toward Acoustic Content Reasoning in The DCASE 2025 Challenge | May 12, 2025 | Audio Question AnsweringQuestion Answering | —Unverified | 0 |
| Kimi-Audio Technical Report | Apr 25, 2025 | Audio Question AnsweringQuestion Answering | CodeCode Available | 7 |
| Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context | Mar 19, 2025 | Audio captioningAudio Question Answering | CodeCode Available | 0 |
| Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering | Mar 14, 2025 | Audio Question AnsweringQuestion Answering | CodeCode Available | 3 |
| Audiopedia: Audio QA with Knowledge | Dec 29, 2024 | Audio Question AnsweringEntity Linking | CodeCode Available | 0 |
| Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models | Sep 10, 2024 | Audio captioningAudio Question Answering | —Unverified | 0 |
| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Jun 17, 2024 | Audio Question AnsweringInstruction Following | CodeCode Available | 2 |
| Audio Dialogues: Dialogues dataset for audio and music understanding | Apr 11, 2024 | Audio captioningAudio Question Answering | —Unverified | 0 |