Spoken Language Understanding

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 550 papers

Title	Date	Tasks	Status	Hype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark	Jun 5, 2025	RhythmSpoken Language Understanding	CodeCode Available	7
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding	May 26, 2025	Kolmogorov-Arnold NetworksSpoken Language Understanding	CodeCode Available	0
ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs	May 26, 2025	cross-modal alignmentEmotion Recognition	—Unverified	0
Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models	May 23, 2025	Speech TokenizationSpoken Language Understanding	—Unverified	0
"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding	May 21, 2025	Machine UnlearningSpoken Language Understanding	CodeCode Available	1
QUADS: QUAntized Distillation Framework for Efficient Speech Language Understanding	May 19, 2025	QuantizationSpoken Language Understanding	CodeCode Available	0
Spoken Language Understanding on Unseen Tasks With In-Context Learning	May 12, 2025	In-Context LearningSpoken Language Understanding	—Unverified	0
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams	Apr 24, 2025	Long-Context UnderstandingSpoken Language Understanding	CodeCode Available	1
Measuring the Effect of Transcription Noise on Downstream Language Understanding Tasks	Feb 19, 2025	Automatic Speech Recognitionspeech-recognition	CodeCode Available	0
Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding	Jan 13, 2025	Automatic Speech Recognitionintent-classification	CodeCode Available	0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding	Jan 10, 2025	Automatic Speech RecognitionClassification	CodeCode Available	0
Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer	Jan 3, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving	Dec 24, 2024	Decision MakingSpoken Language Understanding	—Unverified	0
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate Sector	Dec 13, 2024	In-Context LearningQuestion Answering	CodeCode Available	1
A Survey on Speech Large Language Models	Oct 24, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding	Oct 21, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
SyllableLM: Learning Coarse Semantic Units for Speech Language Models	Oct 5, 2024	ClusteringLanguage Modeling	CodeCode Available	2
Speech Recognition Rescoring with Large Speech-Text Foundation Models	Sep 25, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder	Sep 21, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Increasing faithfulness in human-human dialog summarization with Spoken Language Understanding tasks	Sep 16, 2024	Spoken Language Understanding	—Unverified	0
Clean Label Attacks against SLU Systems	Sep 13, 2024	Data Poisoningspeech-recognition	—Unverified	0
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding	Aug 29, 2024	slot-fillingSlot Filling	—Unverified	0
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond	Aug 7, 2024	BenchmarkingLanguage Identification	CodeCode Available	1
Out-of-distribution generalisation in spoken language understanding	Jul 10, 2024	Spoken Language Understanding	CodeCode Available	0
Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian Dialect	Jul 5, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding	Jun 21, 2024	Cross-corpusDecoder	—Unverified	0
A Contrastive Learning Approach to Mitigate Bias in Speech Models	Jun 20, 2024	Contrastive LearningSpoken Language Understanding	CodeCode Available	0
Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model	Jun 18, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding	Jun 17, 2024	Self-Supervised LearningSpoken Language Understanding	—Unverified	0
CroPrompt: Cross-task Interactive Prompting for Zero-shot Spoken Language Understanding	Jun 15, 2024	Intent Detectionslot-filling	—Unverified	0
On the Evaluation of Speech Foundation Models for Spoken Language Understanding	Jun 14, 2024	BenchmarkingPrediction	—Unverified	0
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding	Jun 13, 2024	Instruction FollowingLanguage Modeling	—Unverified	0
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding	Jun 12, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning	May 31, 2024	Contrastive LearningIntent Detection	—Unverified	0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding	May 23, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0
MSNER: A Multilingual Speech Dataset for Named Entity Recognition	May 19, 2024	named-entity-recognitionNamed Entity Recognition	—Unverified	0
Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants	May 14, 2024	Automatic Speech RecognitionDiversity	—Unverified	0
HC^2L: Hybrid and Cooperative Contrastive Learning for Cross-lingual Spoken Language Understanding	May 10, 2024	Contrastive LearningSpoken Language Understanding	—Unverified	0
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training	Apr 16, 2024	Language ModelingLanguage Modelling	CodeCode Available	0
Large Language Models for Expansion of Spoken Language Understanding Systems to New Languages	Apr 3, 2024	Contrastive LearningMachine Translation	CodeCode Available	1
Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism	Apr 1, 2024	Multi-Task LearningSpoken Language Understanding	—Unverified	0
New Semantic Task for the French Spoken Language Understanding MEDIA Benchmark	Mar 28, 2024	intent-classificationIntent Classification	CodeCode Available	0
Uni-MIS: United Multiple Intent Spoken Language Understanding via Multi-View Intent-Slot Interaction	Mar 24, 2024	Intent Detectionslot-filling	CodeCode Available	0
Privacy-Preserving End-to-End Spoken Language Understanding	Mar 22, 2024	Privacy Preservingspeech-recognition	—Unverified	0
Do Large Language Model Understand Multi-Intent Spoken Language ?	Mar 7, 2024	Language ModelingLanguage Modelling	CodeCode Available	0
What has LeBenchmark Learnt about French Syntax?	Mar 4, 2024	Automatic Speech Recognitionspeech-recognition	—Unverified	0
A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames	Feb 28, 2024	DecoderGraph Attention	CodeCode Available	0
Evaluating and Improving Continual Learning in Spoken Language Understanding	Feb 16, 2024	Continual LearningSpoken Language Understanding	—Unverified	0
The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese	Feb 12, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model	Feb 8, 2024	modelSpoken Language Understanding	—Unverified	0

Show:10 25 50

← PrevPage 1 of 11Next →

All datasets Fluent Speech Commands Snips-SmartLights Snips-SmartSpeaker Spoken-SQuAD Timers and Such

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer + AMT, character-based)	Accuracy (%)	99.8	—	Unverified
2	UniverSLU	Accuracy (%)	99.8	—	Unverified
3	E2E SLP two-step	Accuracy (%)	99.7	—	Unverified
4	textual-kd-slu	Accuracy (%)	99.7	—	Unverified
5	Wav2Vec2.0-Classifier	Accuracy (%)	99.7	—	Unverified
6	Finstreder (Quartznet + AMT)	Accuracy (%)	99.7	—	Unverified
7	Wav2vec 2.0 SSL	Accuracy (%)	99.6	—	Unverified
8	Finstreder (Conformer)	Accuracy (%)	99.5	—	Unverified
9	AT-AT	Accuracy (%)	99.5	—	Unverified
10	BERT, AC Pretraining	Accuracy (%)	99.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer, character-based)	Accuracy (%)	89	—	Unverified
2	Finstreder (Conformer)	Accuracy (%)	88	—	Unverified
3	AT-AT	Accuracy (%)	84.9	—	Unverified
4	Finstreder (Quartznet)	Accuracy (%)	84.8	—	Unverified
5	Snips	Accuracy (%)	84.2	—	Unverified
6	Google	Accuracy (%)	79.3	—	Unverified
7	Real + synthetic	Accuracy (%)	71.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer, character-based)	Accuracy-EN (%)	87.9	—	Unverified
2	Finstreder (Conformer)	Accuracy-EN (%)	80.4	—	Unverified
3	Finstreder (Quartznet)	Accuracy-EN (%)	77.6	—	Unverified
4	Snips	Accuracy-EN (%)	68.7	—	Unverified
5	Google	Accuracy-EN (%)	47.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ALBERT	F1 score	77.1	—	Unverified
2	SpeechBERT	F1 score	71.75	—	Unverified
3	QANet + GAN	F1 score	63.11	—	Unverified
4	Baseline	F1 score	58.71	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Finstreder (Conformer)	Accuracy (%)	95.4	—	Unverified
2	Finstreder (Quartznet)	Accuracy (%)	90	—	Unverified
3	Baseline	Accuracy (%)	81.6	—	Unverified