SOTAVerified

Spoken Language Understanding

Papers

Showing 51100 of 550 papers

TitleStatusHype
Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain AdaptationCode1
A Hierarchical Decoding Model For Spoken Language Understanding From Unaligned DataCode1
Spoken Language Understanding on the EdgeCode1
Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfacesCode1
Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening ComprehensionCode1
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language UnderstandingCode0
ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs0
Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models0
QUADS: QUAntized Distillation Framework for Efficient Speech Language UnderstandingCode0
Spoken Language Understanding on Unseen Tasks With In-Context Learning0
Measuring the Effect of Transcription Noise on Downstream Language Understanding TasksCode0
Joint Automatic Speech Recognition And Structure Learning For Better Speech UnderstandingCode0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer0
An Overview and Discussion of the Suitability of Existing Speech Datasets to Train Machine Learning Models for Collective Problem Solving0
A Survey on Speech Large Language Models0
Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding0
Speech Recognition Rescoring with Large Speech-Text Foundation Models0
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder DecoderCode0
Increasing faithfulness in human-human dialog summarization with Spoken Language Understanding tasks0
Clean Label Attacks against SLU Systems0
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding0
Out-of-distribution generalisation in spoken language understandingCode0
Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian DialectCode0
Prompting Whisper for QA-driven Zero-shot End-to-end Spoken Language Understanding0
A Contrastive Learning Approach to Mitigate Bias in Speech ModelsCode0
Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model0
A dual task learning approach to fine-tune a multilingual semantic speech encoder for Spoken Language Understanding0
CroPrompt: Cross-task Interactive Prompting for Zero-shot Spoken Language Understanding0
On the Evaluation of Speech Foundation Models for Spoken Language Understanding0
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding0
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding0
Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language UnderstandingCode0
MSNER: A Multilingual Speech Dataset for Named Entity Recognition0
Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants0
HC^2L: Hybrid and Cooperative Contrastive Learning for Cross-lingual Spoken Language Understanding0
Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional TrainingCode0
Modeling Output-Level Task Relatedness in Multi-Task Learning with Feedback Mechanism0
New Semantic Task for the French Spoken Language Understanding MEDIA BenchmarkCode0
Uni-MIS: United Multiple Intent Spoken Language Understanding via Multi-View Intent-Slot InteractionCode0
Privacy-Preserving End-to-End Spoken Language Understanding0
Do Large Language Model Understand Multi-Intent Spoken Language ?Code0
What has LeBenchmark Learnt about French Syntax?0
A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic FramesCode0
Evaluating and Improving Continual Learning in Spoken Language Understanding0
The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese0
Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model0
Pro-HAN: A Heterogeneous Graph Attention Network for Profile-Based Spoken Language UnderstandingCode0
Learning Semantic Information from Raw Audio Signal Using Both Contextual and Phonetic Representations0
Show:102550
← PrevPage 2 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer + AMT, character-based)Accuracy (%)99.8Unverified
2UniverSLUAccuracy (%)99.8Unverified
3Finstreder (Quartznet + AMT)Accuracy (%)99.7Unverified
4textual-kd-sluAccuracy (%)99.7Unverified
5Wav2Vec2.0-ClassifierAccuracy (%)99.7Unverified
6E2E SLP two-stepAccuracy (%)99.7Unverified
7Wav2vec 2.0 SSLAccuracy (%)99.6Unverified
8Finstreder (Conformer)Accuracy (%)99.5Unverified
9AT-ATAccuracy (%)99.5Unverified
10BERT, AC PretrainingAccuracy (%)99.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy (%)89Unverified
2Finstreder (Conformer)Accuracy (%)88Unverified
3AT-ATAccuracy (%)84.9Unverified
4Finstreder (Quartznet)Accuracy (%)84.8Unverified
5SnipsAccuracy (%)84.2Unverified
6GoogleAccuracy (%)79.3Unverified
7Real + syntheticAccuracy (%)71.4Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer, character-based)Accuracy-EN (%)87.9Unverified
2Finstreder (Conformer)Accuracy-EN (%)80.4Unverified
3Finstreder (Quartznet)Accuracy-EN (%)77.6Unverified
4SnipsAccuracy-EN (%)68.7Unverified
5GoogleAccuracy-EN (%)47.8Unverified
#ModelMetricClaimedVerifiedStatus
1ALBERTF1 score77.1Unverified
2SpeechBERTF1 score71.75Unverified
3QANet + GANF1 score63.11Unverified
4BaselineF1 score58.71Unverified
#ModelMetricClaimedVerifiedStatus
1Finstreder (Conformer)Accuracy (%)95.4Unverified
2Finstreder (Quartznet)Accuracy (%)90Unverified
3BaselineAccuracy (%)81.6Unverified