SOTAVerified

Automatic Speech Recognition

Papers

Showing 251275 of 3174 papers

TitleStatusHype
Not All Errors Are Equal: Investigation of Speech Recognition Errors in Alzheimer's Disease Detection0
SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASRCode0
Comprehensive Audio Query Handling System with Integrated Expert Models and Contextual Understanding0
ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction0
GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken ChatbotCode7
Late fusion ensembles for speech recognition on diverse input audio representations0
A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario0
Empowering the Deaf and Hard of Hearing Community: Enhancing Video Captions Using Large Language Models0
How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario0
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models0
Continual Learning in Machine Speech Chain Using Gradient Episodic Memory0
AMPS: ASR with Multimodal Paraphrase Supervision0
Aligning Pre-trained Models for Spoken Language Translation0
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection0
k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning0
Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation0
Scaling Speech-Text Pre-training with Synthetic Interleaved DataCode7
Towards Maximum Likelihood Training for Transducer-based Streaming Speech Recognition0
High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR0
Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering0
Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge0
From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language0
CAFE A Novel Code switching Dataset for Algerian Dialect French and English0
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM0
Whisper Finetuning on Nepali Language0
Show:102550
← PrevPage 11 of 127Next →

No leaderboard results yet.