SOTAVerified

Automatic Speech Recognition

Papers

Showing 351375 of 3174 papers

TitleStatusHype
LM-assisted keyword biasing with Aho-Corasick algorithm for Transducer-based ASR0
A Multimodal Dense Retrieval Approach for Speech-Based Open-Domain Question Answering0
Personalized Speech Recognition for Children with Test-Time Adaptation0
Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space0
Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech RecognitionCode0
META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR0
Large Language Models are Strong Audio-Visual Speech Recognition LearnersCode2
ASR Benchmarking: Need for a More Representative Conversational DatasetCode0
WER We Stand: Benchmarking Urdu ASR Models0
Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text0
Chain-of-Thought Prompting for Speech Translation0
Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora0
M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses0
A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models0
SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition0
An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems0
Augmenting Automatic Speech Recognition Models with Disfluency Detection0
Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition0
ASR Error Correction using Large Language Models0
CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments0
Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech0
Exploring SSL Discrete Tokens for Multilingual ASR0
Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages0
LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation0
Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy?0
Show:102550
← PrevPage 15 of 127Next →

No leaderboard results yet.