SOTAVerified

Automatic Speech Recognition

Papers

Showing 376400 of 3174 papers

TitleStatusHype
Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC0
Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction0
Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization0
Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit PoetryCode0
Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation0
BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation SystemCode0
Prompting Whisper for Improved Verbatim Transcription and End-to-end Miscue Detection0
NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding0
Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis0
PSRB: A Comprehensive Benchmark for Evaluating Persian ASR Systems0
Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use0
Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation0
Beyond Manual Transcripts: The Potential of Automated Speech Recognition Errors in Improving Alzheimer's Disease Detection0
KIT's Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization0
Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence0
Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition0
Robust fine-tuning of speech recognition models via model merging: application to disordered speech0
In-context Language Learning for Endangered Languages in Speech Recognition0
The NaijaVoices Dataset: Cultivating Large-Scale, High-Quality, Culturally-Rich Speech Data for African Languages0
Exploring Generative Error Correction for Dysarthric Speech RecognitionCode0
CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASRCode0
LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic ContextCode0
VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining0
An Effective Training Framework for Light-Weight Automatic Speech Recognition Models0
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
Show:102550
← PrevPage 16 of 127Next →

No leaderboard results yet.