SOTAVerified

Spoken Language Understanding

Papers

Showing 150 of 550 papers

TitleStatusHype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
Speech Model Pre-training for End-to-End Spoken Language UnderstandingCode2
Using Speech Synthesis to Train End-to-End Spoken Language Understanding ModelsCode2
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPTCode2
SyllableLM: Learning Coarse Semantic Units for Speech Language ModelsCode2
Spoken Language Understanding on the EdgeCode1
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and BeyondCode1
T5lephone: Bridging Speech and Text Self-supervised Models for Spoken Language Understanding via Phoneme level T5Code1
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model PretrainingCode1
N-Best ASR Transformer: Enhancing SLU Performance using Multiple ASR HypothesesCode1
Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible InputsCode1
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?Code1
ESPnet-ONNX: Bridging a Gap Between Research and ProductionCode1
Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening ComprehensionCode1
Skit-S2I: An Indian Accented Speech to Intent datasetCode1
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddingsCode1
Large Language Models for Expansion of Spoken Language Understanding Systems to New LanguagesCode1
Learning Spoken Language Representations with Neural Lattice Language ModelingCode1
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live StreamsCode1
Comparative layer-wise analysis of self-supervised speech modelsCode1
OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language UnderstandingCode1
SPLAT: Speech-Language Joint Pre-Training for Spoken Language UnderstandingCode1
SLURP: A Spoken Language Understanding Resource PackageCode1
Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfacesCode1
SpeechBrain: A General-Purpose Speech ToolkitCode1
AISHELL-NER: Named Entity Recognition from Chinese SpeechCode1
ITALIC: An Italian Intent Classification DatasetCode1
"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language UnderstandingCode1
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnetCode1
A Label-Aware BERT Attention Network for Zero-Shot Multi-Intent Detection in Spoken Language UnderstandingCode1
C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot FillingCode1
Improving fairness for spoken language understanding in atypical speech with Text-to-SpeechCode1
A Survey on Spoken Language Understanding: Recent Advances and New FrontiersCode1
Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language UnderstandingCode1
Joint Multiple Intent Detection and Slot Filling with Supervised Contrastive Learning and Self-DistillationCode1
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent ClassificationCode1
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language UnderstandingCode1
BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation WritingCode1
A Co-Interactive Transformer for Joint Slot Filling and Intent DetectionCode1
Contrastive Learning for Improving ASR Robustness in Spoken Language UnderstandingCode1
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from SpeechCode1
Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain AdaptationCode1
Data Augmentation for Spoken Language Understanding via Pretrained Language ModelsCode1
Cross-lingual Spoken Language Understanding with Regularized Representation AlignmentCode1
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate SectorCode1
RNN Transducer Models For Spoken Language UnderstandingCode1
Adapting Pretrained Transformer to Lattices for Spoken Language UnderstandingCode1
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural SpeechCode1
A Hierarchical Decoding Model For Spoken Language Understanding From Unaligned DataCode1
SlotRefine: A Fast Non-Autoregressive Model for Joint Intent Detection and Slot FillingCode1
Show:102550
← PrevPage 1 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer + AMT, character-based)Accuracy (%)99.8Unverified
2UniverSLUAccuracy (%)99.8Unverified
3Finstreder (Quartznet + AMT)Accuracy (%)99.7Unverified
4textual-kd-sluAccuracy (%)99.7Unverified
5Wav2Vec2.0-ClassifierAccuracy (%)99.7Unverified
6E2E SLP two-stepAccuracy (%)99.7Unverified
7Wav2vec 2.0 SSLAccuracy (%)99.6Unverified
8Finstreder (Conformer)Accuracy (%)99.5Unverified
9AT-ATAccuracy (%)99.5Unverified
10BERT, AC PretrainingAccuracy (%)99.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy (%)89Unverified
2Finstreder (Conformer)Accuracy (%)88Unverified
3AT-ATAccuracy (%)84.9Unverified
4Finstreder (Quartznet)Accuracy (%)84.8Unverified
5SnipsAccuracy (%)84.2Unverified
6GoogleAccuracy (%)79.3Unverified
7Real + syntheticAccuracy (%)71.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy-EN (%)87.9Unverified
2Finstreder (Conformer)Accuracy-EN (%)80.4Unverified
3Finstreder (Quartznet)Accuracy-EN (%)77.6Unverified
4SnipsAccuracy-EN (%)68.7Unverified
5GoogleAccuracy-EN (%)47.8Unverified
#ModelMetricClaimedVerifiedStatus
1ALBERTF1 score77.1Unverified
2SpeechBERTF1 score71.75Unverified
3QANet + GANF1 score63.11Unverified
4BaselineF1 score58.71Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer)Accuracy (%)95.4Unverified
2Finstreder (Quartznet)Accuracy (%)90Unverified
3BaselineAccuracy (%)81.6Unverified