| MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark | Jun 5, 2025 | RhythmSpoken Language Understanding | CodeCode Available | 7 |
| "KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding | May 26, 2025 | Kolmogorov-Arnold NetworksSpoken Language Understanding | CodeCode Available | 0 |
| ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs | May 26, 2025 | cross-modal alignmentEmotion Recognition | —Unverified | 0 |
| Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models | May 23, 2025 | Speech TokenizationSpoken Language Understanding | —Unverified | 0 |
| "Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding | May 21, 2025 | Machine UnlearningSpoken Language Understanding | CodeCode Available | 1 |
| QUADS: QUAntized Distillation Framework for Efficient Speech Language Understanding | May 19, 2025 | QuantizationSpoken Language Understanding | CodeCode Available | 0 |
| Spoken Language Understanding on Unseen Tasks With In-Context Learning | May 12, 2025 | In-Context LearningSpoken Language Understanding | —Unverified | 0 |
| LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams | Apr 24, 2025 | Long-Context UnderstandingSpoken Language Understanding | CodeCode Available | 1 |
| Measuring the Effect of Transcription Noise on Downstream Language Understanding Tasks | Feb 19, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding | Jan 13, 2025 | Automatic Speech Recognitionintent-classification | CodeCode Available | 0 |
| Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding | Jan 10, 2025 | Automatic Speech RecognitionClassification | CodeCode Available | 0 |
| Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer | Jan 3, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving | Dec 24, 2024 | Decision MakingSpoken Language Understanding | —Unverified | 0 |
| RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector | Dec 13, 2024 | In-Context LearningQuestion Answering | CodeCode Available | 1 |
| A Survey on Speech Large Language Models | Oct 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding | Oct 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SyllableLM: Learning Coarse Semantic Units for Speech Language Models | Oct 5, 2024 | ClusteringLanguage Modeling | CodeCode Available | 2 |
| Speech Recognition Rescoring with Large Speech-Text Foundation Models | Sep 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder | Sep 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Increasing faithfulness in human-human dialog summarization with Spoken Language Understanding tasks | Sep 16, 2024 | Spoken Language Understanding | —Unverified | 0 |
| Clean Label Attacks against SLU Systems | Sep 13, 2024 | Data Poisoningspeech-recognition | —Unverified | 0 |
| WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding | Aug 29, 2024 | slot-fillingSlot Filling | —Unverified | 0 |
| Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond | Aug 7, 2024 | BenchmarkingLanguage Identification | CodeCode Available | 1 |
| Out-of-distribution generalisation in spoken language understanding | Jul 10, 2024 | Spoken Language Understanding | CodeCode Available | 0 |
| Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian Dialect | Jul 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |