SOTAVerified

Automatic Speech Recognition

Papers

Showing 11511200 of 3174 papers

TitleStatusHype
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation0
Entity Linking for Spoken Language0
Entity resolution for noisy ASR transcripts0
Environment-aware Reconfigurable Noise Suppression0
E-PUR: An Energy-Efficient Processing Unit for Recurrent Neural Networks0
Equivalence of Segmental and Neural Transducer Modeling: A Proof of Concept0
Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization0
Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition0
Error Correction in ASR using Sequence-to-Sequence Models0
Error Detection in Automatic Speech Recognition0
Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages0
Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass0
Blending LSTMs into CNNs0
ESPnet-se: end-to-end speech enhancement and separation toolkit designed for asr integration0
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding0
ESPnet-ST: All-in-One Speech Translation Toolkit0
Breaking Walls: Pioneering Automatic Speech Recognition for Central Kurdish: End-to-End Transformer Paradigm0
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition0
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks0
Etude de la performance des modèles acoustiques pour des voix de personnes âgées en vue de l'adaptation des systèmes de RAP (Assessment of the acoustic models performance in the ageing voice case for ASR system adaptation) [in French]0
Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning0
EURO: ESPnet Unsupervised ASR Open-source Toolkit0
Euronews: a multilingual speech corpus for ASR0
Europarl-ST: A Multilingual Corpus For Speech Translation Of Parliamentary Debates0
Evaluating and Improving Automatic Speech Recognition Systems for Korean Meteorological Experts0
Evaluating and Improving Child-Directed Automatic Speech Recognition0
Evaluating and reducing the distance between synthetic and real speech distributions0
Evaluating ASR Confidence Scores for Automated Error Detection in User-Assisted Correction Interfaces0
Evaluating Automatic Speech Recognition Systems in Comparison With Human Perception Results Using Distinctive Feature Measures0
Evaluating Automatic Speech Recognition Quality and Its Impact on Counselor Utterance Coding0
Evaluating Automatic Speech Recognition in Translation0
Evaluating Automatic Speech Recognition in an Incremental Setting0
Enhancing AAC Software for Dysarthric Speakers in e-Health Settings: An Evaluation Using TORGO0
Evaluating Standard and Dialectal Frisian ASR: Multilingual Fine-tuning and Language Identification for Improved Low-resource Performance0
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric0
Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition0
Enhancements in statistical spoken language translation by de-normalization of ASR results0
Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective0
Evaluation of Automatic Speech Recognition for Conversational Speech in Dutch, English and German: What Goes Missing?0
BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators0
Evaluation of Off-the-shelf Speech Recognizers Across Diverse Dialogue Domains0
Evaluation of Off-the-shelf Speech Recognizers on Different Accents in a Dialogue Domain0
Evaluation of real-time transcriptions using end-to-end ASR models0
Evaluation of Speaker Anonymization on Emotional Speech0
Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data0
Everything Can Be Described in Words: A Simple Unified Multi-Modal Framework with Semantic and Temporal Alignment0
Evolutionary optimization of contexts for phonetic correction in speech recognition systems0
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 20240
Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence0
Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization0
Show:102550
← PrevPage 24 of 64Next →

No leaderboard results yet.