Spoken Language Understanding

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 550 papers

Title	Date	Tasks	Status	Hype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark	Jun 5, 2025	RhythmSpoken Language Understanding	CodeCode Available	7
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding	May 26, 2025	Kolmogorov-Arnold NetworksSpoken Language Understanding	CodeCode Available	0
ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs	May 26, 2025	cross-modal alignmentEmotion Recognition	—Unverified	0
Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models	May 23, 2025	Speech TokenizationSpoken Language Understanding	—Unverified	0
"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding	May 21, 2025	Machine UnlearningSpoken Language Understanding	CodeCode Available	1
QUADS: QUAntized Distillation Framework for Efficient Speech Language Understanding	May 19, 2025	QuantizationSpoken Language Understanding	CodeCode Available	0
Spoken Language Understanding on Unseen Tasks With In-Context Learning	May 12, 2025	In-Context LearningSpoken Language Understanding	—Unverified	0
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams	Apr 24, 2025	Long-Context UnderstandingSpoken Language Understanding	CodeCode Available	1
Measuring the Effect of Transcription Noise on Downstream Language Understanding Tasks	Feb 19, 2025	Automatic Speech Recognitionspeech-recognition	CodeCode Available	0
Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding	Jan 13, 2025	Automatic Speech Recognitionintent-classification	CodeCode Available	0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding	Jan 10, 2025	Automatic Speech RecognitionClassification	CodeCode Available	0
Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer	Jan 3, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving	Dec 24, 2024	Decision MakingSpoken Language Understanding	—Unverified	0
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector	Dec 13, 2024	In-Context LearningQuestion Answering	CodeCode Available	1
A Survey on Speech Large Language Models	Oct 24, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding	Oct 21, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
SyllableLM: Learning Coarse Semantic Units for Speech Language Models	Oct 5, 2024	ClusteringLanguage Modeling	CodeCode Available	2
Speech Recognition Rescoring with Large Speech-Text Foundation Models	Sep 25, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder	Sep 21, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Increasing faithfulness in human-human dialog summarization with Spoken Language Understanding tasks	Sep 16, 2024	Spoken Language Understanding	—Unverified	0
Clean Label Attacks against SLU Systems	Sep 13, 2024	Data Poisoningspeech-recognition	—Unverified	0
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding	Aug 29, 2024	slot-fillingSlot Filling	—Unverified	0
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond	Aug 7, 2024	BenchmarkingLanguage Identification	CodeCode Available	1
Out-of-distribution generalisation in spoken language understanding	Jul 10, 2024	Spoken Language Understanding	CodeCode Available	0
Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian Dialect	Jul 5, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0

Show:10 25 50

← PrevPage 1 of 22Next →

All datasets Fluent Speech Commands Snips-SmartLights Snips-SmartSpeaker Spoken-SQuAD Timers and Such

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer + AMT, character-based)	Accuracy (%)	99.8	—	Unverified
2	UniverSLU	Accuracy (%)	99.8	—	Unverified
3	E2E SLP two-step	Accuracy (%)	99.7	—	Unverified
4	textual-kd-slu	Accuracy (%)	99.7	—	Unverified
5	Wav2Vec2.0-Classifier	Accuracy (%)	99.7	—	Unverified
6	Finstreder (Quartznet + AMT)	Accuracy (%)	99.7	—	Unverified
7	Wav2vec 2.0 SSL	Accuracy (%)	99.6	—	Unverified
8	Finstreder (Conformer)	Accuracy (%)	99.5	—	Unverified
9	AT-AT	Accuracy (%)	99.5	—	Unverified
10	BERT, AC Pretraining	Accuracy (%)	99.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer, character-based)	Accuracy (%)	89	—	Unverified
2	Finstreder (Conformer)	Accuracy (%)	88	—	Unverified
3	AT-AT	Accuracy (%)	84.9	—	Unverified
4	Finstreder (Quartznet)	Accuracy (%)	84.8	—	Unverified
5	Snips	Accuracy (%)	84.2	—	Unverified
6	Google	Accuracy (%)	79.3	—	Unverified
7	Real + synthetic	Accuracy (%)	71.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer, character-based)	Accuracy-EN (%)	87.9	—	Unverified
2	Finstreder (Conformer)	Accuracy-EN (%)	80.4	—	Unverified
3	Finstreder (Quartznet)	Accuracy-EN (%)	77.6	—	Unverified
4	Snips	Accuracy-EN (%)	68.7	—	Unverified
5	Google	Accuracy-EN (%)	47.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ALBERT	F1 score	77.1	—	Unverified
2	SpeechBERT	F1 score	71.75	—	Unverified
3	QANet + GAN	F1 score	63.11	—	Unverified
4	Baseline	F1 score	58.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer)	Accuracy (%)	95.4	—	Unverified
2	Finstreder (Quartznet)	Accuracy (%)	90	—	Unverified
3	Baseline	Accuracy (%)	81.6	—	Unverified