SOTAVerified

Spoken Language Understanding

Papers

Showing 150 of 550 papers

TitleStatusHype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language UnderstandingCode0
ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs0
Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models0
"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language UnderstandingCode1
QUADS: QUAntized Distillation Framework for Efficient Speech Language UnderstandingCode0
Spoken Language Understanding on Unseen Tasks With In-Context Learning0
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live StreamsCode1
Measuring the Effect of Transcription Noise on Downstream Language Understanding TasksCode0
Joint Automatic Speech Recognition And Structure Learning For Better Speech UnderstandingCode0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer0
An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving0
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate SectorCode1
A Survey on Speech Large Language Models0
Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding0
SyllableLM: Learning Coarse Semantic Units for Speech Language ModelsCode2
Speech Recognition Rescoring with Large Speech-Text Foundation Models0
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder DecoderCode0
Increasing faithfulness in human-human dialog summarization with Spoken Language Understanding tasks0
Clean Label Attacks against SLU Systems0
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding0
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and BeyondCode1
Out-of-distribution generalisation in spoken language understandingCode0
Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian DialectCode0
Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding0
A Contrastive Learning Approach to Mitigate Bias in Speech ModelsCode0
Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model0
A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding0
CroPrompt: Cross-task Interactive Prompting for Zero-shot Spoken Language Understanding0
On the Evaluation of Speech Foundation Models for Spoken Language Understanding0
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding0
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding0
Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language UnderstandingCode0
MSNER: A Multilingual Speech Dataset for Named Entity Recognition0
Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants0
HC^2L: Hybrid and Cooperative Contrastive Learning for Cross-lingual Spoken Language Understanding0
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional TrainingCode0
Large Language Models for Expansion of Spoken Language Understanding Systems to New LanguagesCode1
Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism0
New Semantic Task for the French Spoken Language Understanding MEDIA BenchmarkCode0
Uni-MIS: United Multiple Intent Spoken Language Understanding via Multi-View Intent-Slot InteractionCode0
Privacy-Preserving End-to-End Spoken Language Understanding0
Do Large Language Model Understand Multi-Intent Spoken Language ?Code0
What has LeBenchmark Learnt about French Syntax?0
A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic FramesCode0
Evaluating and Improving Continual Learning in Spoken Language Understanding0
The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese0
Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model0
Show:102550
← PrevPage 1 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer + AMT, character-based)Accuracy (%)99.8Unverified
2UniverSLUAccuracy (%)99.8Unverified
3Finstreder (Quartznet + AMT)Accuracy (%)99.7Unverified
4textual-kd-sluAccuracy (%)99.7Unverified
5Wav2Vec2.0-ClassifierAccuracy (%)99.7Unverified
6E2E SLP two-stepAccuracy (%)99.7Unverified
7Wav2vec 2.0 SSLAccuracy (%)99.6Unverified
8Finstreder (Conformer)Accuracy (%)99.5Unverified
9AT-ATAccuracy (%)99.5Unverified
10BERT, AC PretrainingAccuracy (%)99.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy (%)89Unverified
2Finstreder (Conformer)Accuracy (%)88Unverified
3AT-ATAccuracy (%)84.9Unverified
4Finstreder (Quartznet)Accuracy (%)84.8Unverified
5SnipsAccuracy (%)84.2Unverified
6GoogleAccuracy (%)79.3Unverified
7Real + syntheticAccuracy (%)71.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy-EN (%)87.9Unverified
2Finstreder (Conformer)Accuracy-EN (%)80.4Unverified
3Finstreder (Quartznet)Accuracy-EN (%)77.6Unverified
4SnipsAccuracy-EN (%)68.7Unverified
5GoogleAccuracy-EN (%)47.8Unverified
#ModelMetricClaimedVerifiedStatus
1ALBERTF1 score77.1Unverified
2SpeechBERTF1 score71.75Unverified
3QANet + GANF1 score63.11Unverified
4BaselineF1 score58.71Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer)Accuracy (%)95.4Unverified
2Finstreder (Quartznet)Accuracy (%)90Unverified
3BaselineAccuracy (%)81.6Unverified