SOTAVerified

Spoken Language Understanding

Papers

Showing 125 of 550 papers

TitleStatusHype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
SyllableLM: Learning Coarse Semantic Units for Speech Language ModelsCode2
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPTCode2
Using Speech Synthesis to Train End-to-End Spoken Language Understanding ModelsCode2
Speech Model Pre-training for End-to-End Spoken Language UnderstandingCode2
"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language UnderstandingCode1
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live StreamsCode1
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate SectorCode1
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and BeyondCode1
Large Language Models for Expansion of Spoken Language Understanding Systems to New LanguagesCode1
Improving fairness for spoken language understanding in atypical speech with Text-to-SpeechCode1
BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation WritingCode1
Joint Multiple Intent Detection and Slot Filling with Supervised Contrastive Learning and Self-DistillationCode1
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?Code1
ITALIC: An Italian Intent Classification DatasetCode1
OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language UnderstandingCode1
Skit-S2I: An Indian Accented Speech to Intent datasetCode1
Comparative layer-wise analysis of self-supervised speech modelsCode1
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5Code1
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddingsCode1
ESPnet-ONNX: Bridging a Gap Between Research and ProductionCode1
Contrastive Learning for Improving ASR Robustness in Spoken Language UnderstandingCode1
WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language ModelsCode1
AISHELL-NER: Named Entity Recognition from Chinese SpeechCode1
Text is no more Enough! A Benchmark for Profile-based Spoken Language UnderstandingCode1
Show:102550
← PrevPage 1 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer + AMT, character-based)Accuracy (%)99.8Unverified
2UniverSLUAccuracy (%)99.8Unverified
3E2E SLP two-stepAccuracy (%)99.7Unverified
4textual-kd-sluAccuracy (%)99.7Unverified
5Wav2Vec2.0-ClassifierAccuracy (%)99.7Unverified
6Finstreder (Quartznet + AMT)Accuracy (%)99.7Unverified
7Wav2vec 2.0 SSLAccuracy (%)99.6Unverified
8Finstreder (Conformer)Accuracy (%)99.5Unverified
9AT-ATAccuracy (%)99.5Unverified
10BERT, AC PretrainingAccuracy (%)99.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy (%)89Unverified
2Finstreder (Conformer)Accuracy (%)88Unverified
3AT-ATAccuracy (%)84.9Unverified
4Finstreder (Quartznet)Accuracy (%)84.8Unverified
5SnipsAccuracy (%)84.2Unverified
6GoogleAccuracy (%)79.3Unverified
7Real + syntheticAccuracy (%)71.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy-EN (%)87.9Unverified
2Finstreder (Conformer)Accuracy-EN (%)80.4Unverified
3Finstreder (Quartznet)Accuracy-EN (%)77.6Unverified
4SnipsAccuracy-EN (%)68.7Unverified
5GoogleAccuracy-EN (%)47.8Unverified
#ModelMetricClaimedVerifiedStatus
1ALBERTF1 score77.1Unverified
2SpeechBERTF1 score71.75Unverified
3QANet + GANF1 score63.11Unverified
4BaselineF1 score58.71Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer)Accuracy (%)95.4Unverified
2Finstreder (Quartznet)Accuracy (%)90Unverified
3BaselineAccuracy (%)81.6Unverified