SOTAVerified

Automatic Speech Recognition

Papers

Showing 276300 of 3174 papers

TitleStatusHype
Inter-linguistic Phonetic Composition (IPC): A Theoretical and Computational Approach to Enhance Second Language Pronunciation0
XLSR-Mamba: A Dual-Column Bidirectional State Space Model for Spoofing Attack DetectionCode1
Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data0
Transferable Adversarial Attacks against ASR0
DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions0
CTC-Assisted LLM-Based Contextual ASR0
Dialectal Coverage And Generalization in Arabic Speech RecognitionCode2
Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages0
Enhancing AAC Software for Dysarthric Speakers in e-Health Settings: An Evaluation Using TORGO0
Augmenting Polish Automatic Speech Recognition System With Synthetic Data0
Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising0
Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription0
Asynchronous Tool Usage for Real-Time Agents0
Improving Speech-based Emotion Recognition with Contextual Utterance Analysis and LLMs0
emg2qwerty: A Large Dataset with Baselines for Touch Typing using Surface ElectromyographyCode2
A Survey on Speech Large Language Models0
Evaluating and Improving Automatic Speech Recognition Systems for Korean Meteorological Experts0
ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams0
Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models0
DENOASR: Debiasing ASRs through Selective Denoising0
VoiceBench: Benchmarking LLM-Based Voice AssistantsCode3
Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap0
Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding0
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation0
End-to-End Transformer-based Automatic Speech Recognition for Northern Kurdish: A Pioneering Approach0
Show:102550
← PrevPage 12 of 127Next →

No leaderboard results yet.