SOTAVerified

Automatic Speech Recognition

Papers

Showing 626650 of 3174 papers

TitleStatusHype
Revisiting Acoustic Features for Robust ASR0
Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices0
Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs0
Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM0
Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error CorrectionCode0
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder0
A Multimodal Dense Retrieval Approach for Speech-Based Open-Domain Question Answering0
LM-assisted keyword biasing with Aho-Corasick algorithm for Transducer-based ASR0
Large Language Model Should Understand Pinyin for Chinese ASR Error Correction0
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper0
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection0
Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech RecognitionCode0
Personalized Speech Recognition for Children with Test-Time Adaptation0
Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space0
ASR Benchmarking: Need for a More Representative Conversational DatasetCode0
META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR0
Chain-of-Thought Prompting for Speech Translation0
Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora0
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses0
WER We Stand: Benchmarking Urdu ASR Models0
Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text0
An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems0
Augmenting Automatic Speech Recognition Models with Disfluency Detection0
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models0
SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition0
Show:102550
← PrevPage 26 of 127Next →

No leaderboard results yet.