| MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark | Jun 5, 2025 | RhythmSpoken Language Understanding | CodeCode Available | 7 |
| SyllableLM: Learning Coarse Semantic Units for Speech Language Models | Oct 5, 2024 | ClusteringLanguage Modeling | CodeCode Available | 2 |
| LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT | Oct 7, 2023 | Audio captioningAutomatic Speech Recognition | CodeCode Available | 2 |
| Using Speech Synthesis to Train End-to-End Spoken Language Understanding Models | Oct 21, 2019 | Data AugmentationNatural Language Understanding | CodeCode Available | 2 |
| Speech Model Pre-training for End-to-End Spoken Language Understanding | Apr 7, 2019 | Speech-to-TextSpoken Language Understanding | CodeCode Available | 2 |
| "Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding | May 21, 2025 | Machine UnlearningSpoken Language Understanding | CodeCode Available | 1 |
| LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams | Apr 24, 2025 | Long-Context UnderstandingSpoken Language Understanding | CodeCode Available | 1 |
| RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector | Dec 13, 2024 | In-Context LearningQuestion Answering | CodeCode Available | 1 |
| Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond | Aug 7, 2024 | BenchmarkingLanguage Identification | CodeCode Available | 1 |
| Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages | Apr 3, 2024 | Contrastive LearningMachine Translation | CodeCode Available | 1 |