SOTAVerified

Automatic Speech Recognition

Papers

Showing 76100 of 3174 papers

TitleStatusHype
Robust fine-tuning of speech recognition models via model merging: application to disordered speech0
Beyond Manual Transcripts: The Potential of Automated Speech Recognition Errors in Improving Alzheimer's Disease Detection0
Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence0
Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition0
KIT's Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization0
Exploring Generative Error Correction for Dysarthric Speech RecognitionCode0
CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASRCode0
VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining0
LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic ContextCode0
CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-trainingCode11
Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across ModalitiesCode1
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
An Effective Training Framework for Light-Weight Automatic Speech Recognition Models0
Large Language Models based ASR Error Correction for Child Conversations0
From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech RecognitionCode1
Word Level Timestamp Generation for Automatic Speech Recognition and TranslationCode0
From Weak Labels to Strong Results: Utilizing 5,000 Hours of Noisy Classroom Transcripts with Minimal Accurate Data0
Impact of Frame Rates on Speech Tokenizer: A Case Study on Mandarin and English0
In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties0
Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource LanguagesCode0
PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech DialogsCode0
Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down0
KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 20250
Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR0
ASR-FAIRBENCH: Measuring and Benchmarking Equity Across Speech Recognition Systems0
Show:102550
← PrevPage 4 of 127Next →

No leaderboard results yet.