SOTAVerified

Automatic Speech Recognition

Papers

Showing 176200 of 3174 papers

TitleStatusHype
CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition0
A Differentiable Alignment Framework for Sequence-to-Sequence Modeling via Optimal Transport0
Data-Driven Mispronunciation Pattern Discovery for Robust Speech Recognition0
Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo LanguageCode1
When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation0
Language Bias in Self-Supervised Learning For Automatic Speech Recognition0
SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions0
Cross-lingual Embedding Clustering for Hierarchical Softmax in Low-Resource Multilingual Speech Recognition0
Classification Error Bound for Low Bayes Error Conditions in Machine Learning0
SEAL: Speech Embedding Alignment Learning for Speech Large Language Model with Retrieval-Augmented Generation0
The Multicultural Medical Assistant: Can LLMs Improve Medical ASR Errors Across Borders?0
Speech Translation Refinement using Large Language ModelsCode0
FireRedASR: Open-Source Industrial-Grade Mandarin Speech Recognition Models from Encoder-Decoder to LLM IntegrationCode5
LoCoML: A Framework for Real-World ML Inference Pipelines0
Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing0
FlanEC: Exploring Flan-T5 for Post-ASR Error CorrectionCode1
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
Investigation of Whisper ASR Hallucinations Induced by Non-Speech Audio0
Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges0
GEC-RAG: Improving Generative Error Correction via Retrieval-Augmented Generation for Automatic Speech Recognition Systems0
A Benchmark of French ASR Systems Based on Error Severity0
Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR0
Automatic Speech Recognition for Sanskrit with Transfer Learning0
Delayed Fusion: Integrating Large Language Models into First-Pass Decoding in End-to-end Speech Recognition0
PIER: A Novel Metric for Evaluating What Matters in Code-SwitchingCode0
Show:102550
← PrevPage 8 of 127Next →

No leaderboard results yet.