SOTAVerified

Spoken Language Understanding

Papers

Showing 125 of 550 papers

TitleStatusHype
MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning BenchmarkCode7
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language UnderstandingCode0
ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs0
Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models0
"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language UnderstandingCode1
QUADS: QUAntized Distillation Framework for Efficient Speech Language UnderstandingCode0
Spoken Language Understanding on Unseen Tasks With In-Context Learning0
LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live StreamsCode1
Measuring the Effect of Transcription Noise on Downstream Language Understanding TasksCode0
Joint Automatic Speech Recognition And Structure Learning For Better Speech UnderstandingCode0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer0
An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving0
RETQA: A Large-Scale Open-Domain Tabular Question Answering Dataset for Real Estate SectorCode1
A Survey on Speech Large Language Models0
Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding0
SyllableLM: Learning Coarse Semantic Units for Speech Language ModelsCode2
Speech Recognition Rescoring with Large Speech-Text Foundation Models0
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder DecoderCode0
Increasing faithfulness in human-human dialog summarization with Spoken Language Understanding tasks0
Clean Label Attacks against SLU Systems0
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding0
Speech-MASSIVE: A Multilingual Speech Dataset for SLU and BeyondCode1
Out-of-distribution generalisation in spoken language understandingCode0
Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian DialectCode0
Show:102550
← PrevPage 1 of 22Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer + AMT, character-based)Accuracy (%)99.8Unverified
2UniverSLUAccuracy (%)99.8Unverified
3E2E SLP two-stepAccuracy (%)99.7Unverified
4textual-kd-sluAccuracy (%)99.7Unverified
5Wav2Vec2.0-ClassifierAccuracy (%)99.7Unverified
6Finstreder (Quartznet + AMT)Accuracy (%)99.7Unverified
7Wav2vec 2.0 SSLAccuracy (%)99.6Unverified
8Finstreder (Conformer)Accuracy (%)99.5Unverified
9AT-ATAccuracy (%)99.5Unverified
10BERT, AC PretrainingAccuracy (%)99.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy (%)89Unverified
2Finstreder (Conformer)Accuracy (%)88Unverified
3AT-ATAccuracy (%)84.9Unverified
4Finstreder (Quartznet)Accuracy (%)84.8Unverified
5SnipsAccuracy (%)84.2Unverified
6GoogleAccuracy (%)79.3Unverified
7Real + syntheticAccuracy (%)71.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy-EN (%)87.9Unverified
2Finstreder (Conformer)Accuracy-EN (%)80.4Unverified
3Finstreder (Quartznet)Accuracy-EN (%)77.6Unverified
4SnipsAccuracy-EN (%)68.7Unverified
5GoogleAccuracy-EN (%)47.8Unverified
#ModelMetricClaimedVerifiedStatus
1ALBERTF1 score77.1Unverified
2SpeechBERTF1 score71.75Unverified
3QANet + GANF1 score63.11Unverified
4BaselineF1 score58.71Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer)Accuracy (%)95.4Unverified
2Finstreder (Quartznet)Accuracy (%)90Unverified
3BaselineAccuracy (%)81.6Unverified