Spoken Language Understanding

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 550 papers

Title	Date	Tasks	Status	Hype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark	Jun 5, 2025	RhythmSpoken Language Understanding	CodeCode Available	7
SyllableLM: Learning Coarse Semantic Units for Speech Language Models	Oct 5, 2024	ClusteringLanguage Modeling	CodeCode Available	2
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT	Oct 7, 2023	Audio captioningAutomatic Speech Recognition	CodeCode Available	2
Using Speech Synthesis to Train End-to-End Spoken Language Understanding Models	Oct 21, 2019	Data AugmentationNatural Language Understanding	CodeCode Available	2
Speech Model Pre-training for End-to-End Spoken Language Understanding	Apr 7, 2019	Speech-to-TextSpoken Language Understanding	CodeCode Available	2
"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding	May 21, 2025	Machine UnlearningSpoken Language Understanding	CodeCode Available	1
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams	Apr 24, 2025	Long-Context UnderstandingSpoken Language Understanding	CodeCode Available	1
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector	Dec 13, 2024	In-Context LearningQuestion Answering	CodeCode Available	1
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond	Aug 7, 2024	BenchmarkingLanguage Identification	CodeCode Available	1
Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages	Apr 3, 2024	Contrastive LearningMachine Translation	CodeCode Available	1
Improving fairness for spoken language understanding in atypical speech with Text-to-Speech	Nov 16, 2023	Data AugmentationFairness	CodeCode Available	1
BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing	Sep 2, 2023	speech-recognitionSpeech Recognition	CodeCode Available	1
Joint Multiple Intent Detection and Slot Filling with Supervised Contrastive Learning and Self-Distillation	Aug 28, 2023	Contrastive LearningIntent Detection	CodeCode Available	1
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?	Jun 14, 2023	Natural Language UnderstandingSelf-Supervised Learning	CodeCode Available	1
ITALIC: An Italian Intent Classification Dataset	Jun 14, 2023	Classificationintent-classification	CodeCode Available	1
OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language Understanding	May 17, 2023	Spoken Language Understanding	CodeCode Available	1
Skit-S2I: An Indian Accented Speech to Intent dataset	Dec 26, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Comparative layer-wise analysis of self-supervised speech models	Nov 8, 2022	speech-recognitionSpeech Recognition	CodeCode Available	1
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5	Nov 1, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings	Oct 23, 2022	Acoustic Unit DiscoveryContrastive Learning	CodeCode Available	1
ESPnet-ONNX: Bridging a Gap Between Research and Production	Sep 20, 2022	Spoken Language Understanding	CodeCode Available	1
Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding	May 2, 2022	Contrastive LearningSpoken Language Understanding	CodeCode Available	1
WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models	Mar 29, 2022	Few-Shot LearningLanguage Modeling	CodeCode Available	1
AISHELL-NER: Named Entity Recognition from Chinese Speech	Feb 17, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding	Dec 22, 2021	Intent DetectionSemantic Frame Parsing	CodeCode Available	1
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet	Nov 29, 2021	Spoken Language Understandingtext-to-speech	CodeCode Available	1
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech	Nov 19, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
A Label-Aware BERT Attention Network for Zero-Shot Multi-Intent Detection in Spoken Language Understanding	Nov 1, 2021	Intent DetectionSpoken Language Understanding	CodeCode Available	1
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification	Aug 5, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act Recognition	Jul 5, 2021	SegmentationSpecificity	CodeCode Available	1
N-Best ASR Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses	Jun 11, 2021	Spoken Language Understanding	CodeCode Available	1
SpeechBrain: A General-Purpose Speech Toolkit	Jun 8, 2021	Language IdentificationSpoken Language Understanding	CodeCode Available	1
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech	Apr 23, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
RNN Transducer Models For Spoken Language Understanding	Apr 8, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs	Apr 7, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
A Survey on Spoken Language Understanding: Recent Advances and New Frontiers	Mar 4, 2021	Spoken Language UnderstandingSurvey	CodeCode Available	1
C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling	Dec 13, 2020	Data AugmentationDiversity	CodeCode Available	1
Triplet Entropy Loss: Improving The Generalisation of Short Speech Language Identification Systems	Dec 3, 2020	Language IdentificationSpeech Language Identification	CodeCode Available	1
SLURP: A Spoken Language Understanding Resource Package	Nov 26, 2020	Intent ClassificationSlot Filling	CodeCode Available	1
Adapting Pretrained Transformer to Lattices for Spoken Language Understanding	Nov 2, 2020	Natural Language Understandingspeech-recognition	CodeCode Available	1
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining	Oct 26, 2020	Language ModelingLanguage Modelling	CodeCode Available	1
A Co-Interactive Transformer for Joint Slot Filling and Intent Detection	Oct 8, 2020	Intent Detectionslot-filling	CodeCode Available	1
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding	Oct 8, 2020	Intent DetectionSentence	CodeCode Available	1
SlotRefine: A Fast Non-Autoregressive Model for Joint Intent Detection and Slot Filling	Oct 6, 2020	Intent Detectionslot-filling	CodeCode Available	1
SPLAT: Speech-Language Joint Pre-Training for Spoken Language Understanding	Oct 5, 2020	Language ModelingLanguage Modelling	CodeCode Available	1
Cross-lingual Spoken Language Understanding with Regularized Representation Alignment	Sep 30, 2020	SentenceSpoken Language Understanding	CodeCode Available	1
Learning Spoken Language Representations with Neural Lattice Language Modeling	Jul 6, 2020	Intent DetectionLanguage Modeling	CodeCode Available	1
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding	May 24, 2020	Spoken Language Understanding	CodeCode Available	1
Data Augmentation for Spoken Language Understanding via Pretrained Language Models	Apr 29, 2020	Data AugmentationSpoken Language Understanding	CodeCode Available	1
AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slot Filling	Apr 21, 2020	Intent DetectionSemantic Frame Parsing	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 11Next →

All datasets Fluent Speech Commands Snips-SmartLights Snips-SmartSpeaker Spoken-SQuAD Timers and Such

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer + AMT, character-based)	Accuracy (%)	99.8	—	Unverified
2	UniverSLU	Accuracy (%)	99.8	—	Unverified
3	E2E SLP two-step	Accuracy (%)	99.7	—	Unverified
4	textual-kd-slu	Accuracy (%)	99.7	—	Unverified
5	Wav2Vec2.0-Classifier	Accuracy (%)	99.7	—	Unverified
6	Finstreder (Quartznet + AMT)	Accuracy (%)	99.7	—	Unverified
7	Wav2vec 2.0 SSL	Accuracy (%)	99.6	—	Unverified
8	Finstreder (Conformer)	Accuracy (%)	99.5	—	Unverified
9	AT-AT	Accuracy (%)	99.5	—	Unverified
10	BERT, AC Pretraining	Accuracy (%)	99.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer, character-based)	Accuracy (%)	89	—	Unverified
2	Finstreder (Conformer)	Accuracy (%)	88	—	Unverified
3	AT-AT	Accuracy (%)	84.9	—	Unverified
4	Finstreder (Quartznet)	Accuracy (%)	84.8	—	Unverified
5	Snips	Accuracy (%)	84.2	—	Unverified
6	Google	Accuracy (%)	79.3	—	Unverified
7	Real + synthetic	Accuracy (%)	71.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer, character-based)	Accuracy-EN (%)	87.9	—	Unverified
2	Finstreder (Conformer)	Accuracy-EN (%)	80.4	—	Unverified
3	Finstreder (Quartznet)	Accuracy-EN (%)	77.6	—	Unverified
4	Snips	Accuracy-EN (%)	68.7	—	Unverified
5	Google	Accuracy-EN (%)	47.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ALBERT	F1 score	77.1	—	Unverified
2	SpeechBERT	F1 score	71.75	—	Unverified
3	QANet + GAN	F1 score	63.11	—	Unverified
4	Baseline	F1 score	58.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer)	Accuracy (%)	95.4	—	Unverified
2	Finstreder (Quartznet)	Accuracy (%)	90	—	Unverified
3	Baseline	Accuracy (%)	81.6	—	Unverified