| MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark | Jun 5, 2025 | RhythmSpoken Language Understanding | CodeCode Available | 7 |
| LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT | Oct 7, 2023 | Audio captioningAutomatic Speech Recognition | CodeCode Available | 2 |
| Speech Model Pre-training for End-to-End Spoken Language Understanding | Apr 7, 2019 | Speech-to-TextSpoken Language Understanding | CodeCode Available | 2 |
| Using Speech Synthesis to Train End-to-End Spoken Language Understanding Models | Oct 21, 2019 | Data AugmentationNatural Language Understanding | CodeCode Available | 2 |
| SyllableLM: Learning Coarse Semantic Units for Speech Language Models | Oct 5, 2024 | ClusteringLanguage Modeling | CodeCode Available | 2 |
| BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing | Sep 2, 2023 | speech-recognitionSpeech Recognition | CodeCode Available | 1 |
| A Survey on Spoken Language Understanding: Recent Advances and New Frontiers | Mar 4, 2021 | Spoken Language UnderstandingSurvey | CodeCode Available | 1 |
| Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings | Oct 23, 2022 | Acoustic Unit DiscoveryContrastive Learning | CodeCode Available | 1 |
| A Co-Interactive Transformer for Joint Slot Filling and Intent Detection | Oct 8, 2020 | Intent Detectionslot-filling | CodeCode Available | 1 |
| A Label-Aware BERT Attention Network for Zero-Shot Multi-Intent Detection in Spoken Language Understanding | Nov 1, 2021 | Intent DetectionSpoken Language Understanding | CodeCode Available | 1 |