SOTAVerified

Automatic Speech Recognition

Papers

Showing 801825 of 3174 papers

TitleStatusHype
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets0
Towards Unsupervised Speech Recognition Without Pronunciation ModelsCode0
Transformer-based Model for ASR N-Best Rescoring and Rewriting0
Tag and correct: high precision post-editing approach to correction of speech recognition errors0
Reading Miscue Detection in Primary School through Automatic Speech Recognition0
AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection0
Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter0
ASTRA: Aligning Speech and Text Representations for Asr without Sampling0
MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations0
Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis0
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR0
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores0
To Distill or Not to Distill? On the Robustness of Robust Knowledge DistillationCode0
Flexible Multichannel Speech Enhancement for Noise-Robust Frontend0
Hypernetworks for Personalizing ASR to Atypical Speech0
Joint Beam Search Integrating CTC, Attention, and Transducer Decoders0
Text Injection for Neural Contextual Biasing0
Enhancing CTC-based speech recognition with diverse modeling units0
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition0
Error-preserving Automatic Speech Recognition of Young English Learners' LanguageCode0
Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping0
Keyword-Guided Adaptation of Automatic Speech Recognition0
Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision0
Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach0
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning0
Show:102550
← PrevPage 33 of 127Next →

No leaderboard results yet.