| MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark | Jun 5, 2025 | RhythmSpoken Language Understanding | CodeCode Available | 7 |
| "KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding | May 26, 2025 | Kolmogorov-Arnold NetworksSpoken Language Understanding | CodeCode Available | 0 |
| ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs | May 26, 2025 | cross-modal alignmentEmotion Recognition | —Unverified | 0 |
| Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models | May 23, 2025 | Speech TokenizationSpoken Language Understanding | —Unverified | 0 |
| "Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding | May 21, 2025 | Machine UnlearningSpoken Language Understanding | CodeCode Available | 1 |
| QUADS: QUAntized Distillation Framework for Efficient Speech Language Understanding | May 19, 2025 | QuantizationSpoken Language Understanding | CodeCode Available | 0 |
| Spoken Language Understanding on Unseen Tasks With In-Context Learning | May 12, 2025 | In-Context LearningSpoken Language Understanding | —Unverified | 0 |
| LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams | Apr 24, 2025 | Long-Context UnderstandingSpoken Language Understanding | CodeCode Available | 1 |
| Measuring the Effect of Transcription Noise on Downstream Language Understanding Tasks | Feb 19, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding | Jan 13, 2025 | Automatic Speech Recognitionintent-classification | CodeCode Available | 0 |
| Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding | Jan 10, 2025 | Automatic Speech RecognitionClassification | CodeCode Available | 0 |
| Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer | Jan 3, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving | Dec 24, 2024 | Decision MakingSpoken Language Understanding | —Unverified | 0 |
| RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector | Dec 13, 2024 | In-Context LearningQuestion Answering | CodeCode Available | 1 |
| A Survey on Speech Large Language Models | Oct 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding | Oct 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SyllableLM: Learning Coarse Semantic Units for Speech Language Models | Oct 5, 2024 | ClusteringLanguage Modeling | CodeCode Available | 2 |
| Speech Recognition Rescoring with Large Speech-Text Foundation Models | Sep 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder | Sep 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Increasing faithfulness in human-human dialog summarization with Spoken Language Understanding tasks | Sep 16, 2024 | Spoken Language Understanding | —Unverified | 0 |
| Clean Label Attacks against SLU Systems | Sep 13, 2024 | Data Poisoningspeech-recognition | —Unverified | 0 |
| WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding | Aug 29, 2024 | slot-fillingSlot Filling | —Unverified | 0 |
| Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond | Aug 7, 2024 | BenchmarkingLanguage Identification | CodeCode Available | 1 |
| Out-of-distribution generalisation in spoken language understanding | Jul 10, 2024 | Spoken Language Understanding | CodeCode Available | 0 |
| Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian Dialect | Jul 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding | Jun 21, 2024 | Cross-corpusDecoder | —Unverified | 0 |
| A Contrastive Learning Approach to Mitigate Bias in Speech Models | Jun 20, 2024 | Contrastive LearningSpoken Language Understanding | CodeCode Available | 0 |
| Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model | Jun 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding | Jun 17, 2024 | Self-Supervised LearningSpoken Language Understanding | —Unverified | 0 |
| CroPrompt: Cross-task Interactive Prompting for Zero-shot Spoken Language Understanding | Jun 15, 2024 | Intent Detectionslot-filling | —Unverified | 0 |
| On the Evaluation of Speech Foundation Models for Spoken Language Understanding | Jun 14, 2024 | BenchmarkingPrediction | —Unverified | 0 |
| DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding | Jun 13, 2024 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning | May 31, 2024 | Contrastive LearningIntent Detection | —Unverified | 0 |
| Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MSNER: A Multilingual Speech Dataset for Named Entity Recognition | May 19, 2024 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants | May 14, 2024 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| HC^2L: Hybrid and Cooperative Contrastive Learning for Cross-lingual Spoken Language Understanding | May 10, 2024 | Contrastive LearningSpoken Language Understanding | —Unverified | 0 |
| Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training | Apr 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages | Apr 3, 2024 | Contrastive LearningMachine Translation | CodeCode Available | 1 |
| Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism | Apr 1, 2024 | Multi-Task LearningSpoken Language Understanding | —Unverified | 0 |
| New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark | Mar 28, 2024 | intent-classificationIntent Classification | CodeCode Available | 0 |
| Uni-MIS: United Multiple Intent Spoken Language Understanding via Multi-View Intent-Slot Interaction | Mar 24, 2024 | Intent Detectionslot-filling | CodeCode Available | 0 |
| Privacy-Preserving End-to-End Spoken Language Understanding | Mar 22, 2024 | Privacy Preservingspeech-recognition | —Unverified | 0 |
| Do Large Language Model Understand Multi-Intent Spoken Language ? | Mar 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What has LeBenchmark Learnt about French Syntax? | Mar 4, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames | Feb 28, 2024 | DecoderGraph Attention | CodeCode Available | 0 |
| Evaluating and Improving Continual Learning in Spoken Language Understanding | Feb 16, 2024 | Continual LearningSpoken Language Understanding | —Unverified | 0 |
| The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese | Feb 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model | Feb 8, 2024 | modelSpoken Language Understanding | —Unverified | 0 |