SOTAVerified

Automatic Speech Recognition

Papers

Showing 551600 of 3174 papers

TitleStatusHype
Late fusion ensembles for speech recognition on diverse input audio representations0
Empowering the Deaf and Hard of Hearing Community: Enhancing Video Captions Using Large Language Models0
How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario0
AMPS: ASR with Multimodal Paraphrase Supervision0
Aligning Pre-trained Models for Spoken Language Translation0
Continual Learning in Machine Speech Chain Using Gradient Episodic Memory0
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models0
Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation0
Towards Maximum Likelihood Training for Transducer-based Streaming Speech Recognition0
k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning0
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection0
High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR0
Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering0
Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge0
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM0
From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language0
CAFE A Novel Code switching Dataset for Algerian Dialect French and English0
Whisper Finetuning on Nepali Language0
Inter-linguistic Phonetic Composition (IPC): A Theoretical and Computational Approach to Enhance Second Language Pronunciation0
Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data0
Transferable Adversarial Attacks against ASR0
DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions0
CTC-Assisted LLM-Based Contextual ASR0
Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages0
Enhancing AAC Software for Dysarthric Speakers in e-Health Settings: An Evaluation Using TORGO0
Augmenting Polish Automatic Speech Recognition System With Synthetic Data0
Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising0
Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription0
Asynchronous Tool Usage for Real-Time Agents0
Improving Speech-based Emotion Recognition with Contextual Utterance Analysis and LLMs0
Evaluating and Improving Automatic Speech Recognition Systems for Korean Meteorological Experts0
A Survey on Speech Large Language Models0
ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams0
Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models0
DENOASR: Debiasing ASRs through Selective Denoising0
Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap0
Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding0
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation0
End-to-End Transformer-based Automatic Speech Recognition for Northern Kurdish: A Pioneering Approach0
AC-Mix: Self-Supervised Adaptation for Low-Resource Automatic Speech Recognition using Agnostic Contrastive Mixup0
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation0
Roadmap towards Superhuman Speech Understanding using Large Language Models0
Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR0
Investigation of Speaker Representation for Target-Speaker Speech Processing0
Automatic Speech Recognition with BERT and CTC Transformers: A Review0
Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities0
A two-stage transliteration approach to improve performance of a multilingual ASR0
Advocating Character Error Rate for Multilingual ASR Evaluation0
CR-CTC: Consistency regularization on CTC for improved speech recognition0
Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges0
Show:102550
← PrevPage 12 of 64Next →

No leaderboard results yet.