Spoken Language Understanding

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 550 papers

Title	Date	Tasks	Status	Hype	Score
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark	Jun 5, 2025	RhythmSpoken Language Understanding	CodeCode Available	7	5
Speech Model Pre-training for End-to-End Spoken Language Understanding	Apr 7, 2019	Speech-to-TextSpoken Language Understanding	CodeCode Available	2	5
Using Speech Synthesis to Train End-to-End Spoken Language Understanding Models	Oct 21, 2019	Data AugmentationNatural Language Understanding	CodeCode Available	2	5
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT	Oct 7, 2023	Audio captioningAutomatic Speech Recognition	CodeCode Available	2	5
SyllableLM: Learning Coarse Semantic Units for Speech Language Models	Oct 5, 2024	ClusteringLanguage Modeling	CodeCode Available	2	5
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?	Jun 14, 2023	Natural Language UnderstandingSelf-Supervised Learning	CodeCode Available	1	5
SpeechBrain: A General-Purpose Speech Toolkit	Jun 8, 2021	Language IdentificationSpoken Language Understanding	CodeCode Available	1	5
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5	Nov 1, 2022	Language ModelingLanguage Modelling	CodeCode Available	1	5
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining	Oct 26, 2020	Language ModelingLanguage Modelling	CodeCode Available	1	5
Cross-lingual Spoken Language Understanding with Regularized Representation Alignment	Sep 30, 2020	SentenceSpoken Language Understanding	CodeCode Available	1	5
Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces	May 25, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs	Apr 7, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond	Aug 7, 2024	BenchmarkingLanguage Identification	CodeCode Available	1	5
Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension	Apr 1, 2018	Question AnsweringReading Comprehension	CodeCode Available	1	5
Skit-S2I: An Indian Accented Speech to Intent dataset	Dec 26, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings	Oct 23, 2022	Acoustic Unit DiscoveryContrastive Learning	CodeCode Available	1	5
Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages	Apr 3, 2024	Contrastive LearningMachine Translation	CodeCode Available	1	5
Learning Spoken Language Representations with Neural Lattice Language Modeling	Jul 6, 2020	Intent DetectionLanguage Modeling	CodeCode Available	1	5
A Co-Interactive Transformer for Joint Slot Filling and Intent Detection	Oct 8, 2020	Intent Detectionslot-filling	CodeCode Available	1	5
Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation	Apr 16, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language Understanding	May 17, 2023	Spoken Language Understanding	CodeCode Available	1	5
SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding	Oct 5, 2020	Language ModelingLanguage Modelling	CodeCode Available	1	5
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech	Nov 19, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
SLURP: A Spoken Language Understanding Resource Package	Nov 26, 2020	Intent ClassificationSlot Filling	CodeCode Available	1	5
ESPnet-ONNX: Bridging a Gap Between Research and Production	Sep 20, 2022	Spoken Language Understanding	CodeCode Available	1	5
AISHELL-NER: Named Entity Recognition from Chinese Speech	Feb 17, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding	May 24, 2020	Spoken Language Understanding	CodeCode Available	1	5
"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding	May 21, 2025	Machine UnlearningSpoken Language Understanding	CodeCode Available	1	5
Spoken Language Understanding on the Edge	Oct 30, 2018	Spoken Language Understanding	CodeCode Available	1	5
A Label-Aware BERT Attention Network for Zero-Shot Multi-Intent Detection in Spoken Language Understanding	Nov 1, 2021	Intent DetectionSpoken Language Understanding	CodeCode Available	1	5
Improving fairness for spoken language understanding in atypical speech with Text-to-Speech	Nov 16, 2023	Data AugmentationFairness	CodeCode Available	1	5
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding	Oct 8, 2020	Intent DetectionSentence	CodeCode Available	1	5
A Survey on Spoken Language Understanding: Recent Advances and New Frontiers	Mar 4, 2021	Spoken Language UnderstandingSurvey	CodeCode Available	1	5
ITALIC: An Italian Intent Classification Dataset	Jun 14, 2023	Classificationintent-classification	CodeCode Available	1	5
C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling	Dec 13, 2020	Data AugmentationDiversity	CodeCode Available	1	5
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification	Aug 5, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Joint Multiple Intent Detection and Slot Filling with Supervised Contrastive Learning and Self-Distillation	Aug 28, 2023	Contrastive LearningIntent Detection	CodeCode Available	1	5
BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing	Sep 2, 2023	speech-recognitionSpeech Recognition	CodeCode Available	1	5
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech	Apr 23, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Comparative layer-wise analysis of self-supervised speech models	Nov 8, 2022	speech-recognitionSpeech Recognition	CodeCode Available	1	5
Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding	May 2, 2022	Contrastive LearningSpoken Language Understanding	CodeCode Available	1	5
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams	Apr 24, 2025	Long-Context UnderstandingSpoken Language Understanding	CodeCode Available	1	5
Data Augmentation for Spoken Language Understanding via Pretrained Language Models	Apr 29, 2020	Data AugmentationSpoken Language Understanding	CodeCode Available	1	5
N-Best ASR Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses	Jun 11, 2021	Spoken Language Understanding	CodeCode Available	1	5
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector	Dec 13, 2024	In-Context LearningQuestion Answering	CodeCode Available	1	5
RNN Transducer Models For Spoken Language Understanding	Apr 8, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1	5
Adapting Pretrained Transformer to Lattices for Spoken Language Understanding	Nov 2, 2020	Natural Language Understandingspeech-recognition	CodeCode Available	1	5
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet	Nov 29, 2021	Spoken Language Understandingtext-to-speech	CodeCode Available	1	5
A Hierarchical Decoding Model For Spoken Language Understanding From Unaligned Data	Apr 9, 2019	Spoken Language Understanding	CodeCode Available	1	5
SlotRefine: A Fast Non-Autoregressive Model for Joint Intent Detection and Slot Filling	Oct 6, 2020	Intent Detectionslot-filling	CodeCode Available	1	5

Show:10 25 50

← PrevPage 1 of 11Next →

All datasets Fluent Speech Commands Snips-SmartLights Snips-SmartSpeaker Spoken-SQuAD Timers and Such

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer + AMT, character-based)	Accuracy (%)	99.8	—	Unverified
2	UniverSLU	Accuracy (%)	99.8	—	Unverified
3	E2E SLP two-step	Accuracy (%)	99.7	—	Unverified
4	textual-kd-slu	Accuracy (%)	99.7	—	Unverified
5	Wav2Vec2.0-Classifier	Accuracy (%)	99.7	—	Unverified
6	Finstreder (Quartznet + AMT)	Accuracy (%)	99.7	—	Unverified
7	Wav2vec 2.0 SSL	Accuracy (%)	99.6	—	Unverified
8	Finstreder (Conformer)	Accuracy (%)	99.5	—	Unverified
9	AT-AT	Accuracy (%)	99.5	—	Unverified
10	BERT, AC Pretraining	Accuracy (%)	99.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer, character-based)	Accuracy (%)	89	—	Unverified
2	Finstreder (Conformer)	Accuracy (%)	88	—	Unverified
3	AT-AT	Accuracy (%)	84.9	—	Unverified
4	Finstreder (Quartznet)	Accuracy (%)	84.8	—	Unverified
5	Snips	Accuracy (%)	84.2	—	Unverified
6	Google	Accuracy (%)	79.3	—	Unverified
7	Real + synthetic	Accuracy (%)	71.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer, character-based)	Accuracy-EN (%)	87.9	—	Unverified
2	Finstreder (Conformer)	Accuracy-EN (%)	80.4	—	Unverified
3	Finstreder (Quartznet)	Accuracy-EN (%)	77.6	—	Unverified
4	Snips	Accuracy-EN (%)	68.7	—	Unverified
5	Google	Accuracy-EN (%)	47.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ALBERT	F1 score	77.1	—	Unverified
2	SpeechBERT	F1 score	71.75	—	Unverified
3	QANet + GAN	F1 score	63.11	—	Unverified
4	Baseline	F1 score	58.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer)	Accuracy (%)	95.4	—	Unverified
2	Finstreder (Quartznet)	Accuracy (%)	90	—	Unverified
3	Baseline	Accuracy (%)	81.6	—	Unverified