| MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark | Jun 5, 2025 | RhythmSpoken Language Understanding | CodeCode Available | 7 |
| SyllableLM: Learning Coarse Semantic Units for Speech Language Models | Oct 5, 2024 | ClusteringLanguage Modeling | CodeCode Available | 2 |
| LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT | Oct 7, 2023 | Audio captioningAutomatic Speech Recognition | CodeCode Available | 2 |
| Using Speech Synthesis to Train End-to-End Spoken Language Understanding Models | Oct 21, 2019 | Data AugmentationNatural Language Understanding | CodeCode Available | 2 |
| Speech Model Pre-training for End-to-End Spoken Language Understanding | Apr 7, 2019 | Speech-to-TextSpoken Language Understanding | CodeCode Available | 2 |
| "Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding | May 21, 2025 | Machine UnlearningSpoken Language Understanding | CodeCode Available | 1 |
| LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams | Apr 24, 2025 | Long-Context UnderstandingSpoken Language Understanding | CodeCode Available | 1 |
| RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector | Dec 13, 2024 | In-Context LearningQuestion Answering | CodeCode Available | 1 |
| Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond | Aug 7, 2024 | BenchmarkingLanguage Identification | CodeCode Available | 1 |
| Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages | Apr 3, 2024 | Contrastive LearningMachine Translation | CodeCode Available | 1 |
| Improving fairness for spoken language understanding in atypical speech with Text-to-Speech | Nov 16, 2023 | Data AugmentationFairness | CodeCode Available | 1 |
| BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing | Sep 2, 2023 | speech-recognitionSpeech Recognition | CodeCode Available | 1 |
| Joint Multiple Intent Detection and Slot Filling with Supervised Contrastive Learning and Self-Distillation | Aug 28, 2023 | Contrastive LearningIntent Detection | CodeCode Available | 1 |
| SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge? | Jun 14, 2023 | Natural Language UnderstandingSelf-Supervised Learning | CodeCode Available | 1 |
| ITALIC: An Italian Intent Classification Dataset | Jun 14, 2023 | Classificationintent-classification | CodeCode Available | 1 |
| OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language Understanding | May 17, 2023 | Spoken Language Understanding | CodeCode Available | 1 |
| Skit-S2I: An Indian Accented Speech to Intent dataset | Dec 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Comparative layer-wise analysis of self-supervised speech models | Nov 8, 2022 | speech-recognitionSpeech Recognition | CodeCode Available | 1 |
| T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5 | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings | Oct 23, 2022 | Acoustic Unit DiscoveryContrastive Learning | CodeCode Available | 1 |
| ESPnet-ONNX: Bridging a Gap Between Research and Production | Sep 20, 2022 | Spoken Language Understanding | CodeCode Available | 1 |
| Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding | May 2, 2022 | Contrastive LearningSpoken Language Understanding | CodeCode Available | 1 |
| WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models | Mar 29, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| AISHELL-NER: Named Entity Recognition from Chinese Speech | Feb 17, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding | Dec 22, 2021 | Intent DetectionSemantic Frame Parsing | CodeCode Available | 1 |