SOTAVerified

Spoken Language Understanding

Papers

Showing 150 of 550 papers

TitleStatusHype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
SyllableLM: Learning Coarse Semantic Units for Speech Language ModelsCode2
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPTCode2
Using Speech Synthesis to Train End-to-End Spoken Language Understanding ModelsCode2
Speech Model Pre-training for End-to-End Spoken Language UnderstandingCode2
"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language UnderstandingCode1
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live StreamsCode1
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate SectorCode1
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and BeyondCode1
Large Language Models for Expansion of Spoken Language Understanding Systems to New LanguagesCode1
Improving fairness for spoken language understanding in atypical speech with Text-to-SpeechCode1
BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation WritingCode1
Joint Multiple Intent Detection and Slot Filling with Supervised Contrastive Learning and Self-DistillationCode1
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?Code1
ITALIC: An Italian Intent Classification DatasetCode1
OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language UnderstandingCode1
Skit-S2I: An Indian Accented Speech to Intent datasetCode1
Comparative layer-wise analysis of self-supervised speech modelsCode1
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5Code1
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddingsCode1
ESPnet-ONNX: Bridging a Gap Between Research and ProductionCode1
Contrastive Learning for Improving ASR Robustness in Spoken Language UnderstandingCode1
WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language ModelsCode1
AISHELL-NER: Named Entity Recognition from Chinese SpeechCode1
Text is no more Enough! A Benchmark for Profile-based Spoken Language UnderstandingCode1
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnetCode1
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural SpeechCode1
A Label-Aware BERT Attention Network for Zero-Shot Multi-Intent Detection in Spoken Language UnderstandingCode1
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent ClassificationCode1
What Helps Transformers Recognize Conversational Structure? Importance of Context, Punctuation, and Labels in Dialog Act RecognitionCode1
N-Best ASR Transformer: Enhancing SLU Performance using Multiple ASR HypothesesCode1
SpeechBrain: A General-Purpose Speech ToolkitCode1
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from SpeechCode1
RNN Transducer Models For Spoken Language UnderstandingCode1
Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible InputsCode1
A Survey on Spoken Language Understanding: Recent Advances and New FrontiersCode1
C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot FillingCode1
Triplet Entropy Loss: Improving The Generalisation of Short Speech Language Identification SystemsCode1
SLURP: A Spoken Language Understanding Resource PackageCode1
Adapting Pretrained Transformer to Lattices for Spoken Language UnderstandingCode1
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model PretrainingCode1
A Co-Interactive Transformer for Joint Slot Filling and Intent DetectionCode1
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language UnderstandingCode1
SlotRefine: A Fast Non-Autoregressive Model for Joint Intent Detection and Slot FillingCode1
SPLAT: Speech-Language Joint Pre-Training for Spoken Language UnderstandingCode1
Cross-lingual Spoken Language Understanding with Regularized Representation AlignmentCode1
Learning Spoken Language Representations with Neural Lattice Language ModelingCode1
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language UnderstandingCode1
Data Augmentation for Spoken Language Understanding via Pretrained Language ModelsCode1
AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slot FillingCode1
Show:102550
← PrevPage 1 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer + AMT, character-based)Accuracy (%)99.8Unverified
2UniverSLUAccuracy (%)99.8Unverified
3E2E SLP two-stepAccuracy (%)99.7Unverified
4textual-kd-sluAccuracy (%)99.7Unverified
5Wav2Vec2.0-ClassifierAccuracy (%)99.7Unverified
6Finstreder (Quartznet + AMT)Accuracy (%)99.7Unverified
7Wav2vec 2.0 SSLAccuracy (%)99.6Unverified
8Finstreder (Conformer)Accuracy (%)99.5Unverified
9AT-ATAccuracy (%)99.5Unverified
10BERT, AC PretrainingAccuracy (%)99.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy (%)89Unverified
2Finstreder (Conformer)Accuracy (%)88Unverified
3AT-ATAccuracy (%)84.9Unverified
4Finstreder (Quartznet)Accuracy (%)84.8Unverified
5SnipsAccuracy (%)84.2Unverified
6GoogleAccuracy (%)79.3Unverified
7Real + syntheticAccuracy (%)71.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy-EN (%)87.9Unverified
2Finstreder (Conformer)Accuracy-EN (%)80.4Unverified
3Finstreder (Quartznet)Accuracy-EN (%)77.6Unverified
4SnipsAccuracy-EN (%)68.7Unverified
5GoogleAccuracy-EN (%)47.8Unverified
#ModelMetricClaimedVerifiedStatus
1ALBERTF1 score77.1Unverified
2SpeechBERTF1 score71.75Unverified
3QANet + GANF1 score63.11Unverified
4BaselineF1 score58.71Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer)Accuracy (%)95.4Unverified
2Finstreder (Quartznet)Accuracy (%)90Unverified
3BaselineAccuracy (%)81.6Unverified