SOTAVerified

Automatic Speech Recognition

Papers

Showing 301350 of 3174 papers

TitleStatusHype
End-to-End Speech Recognition and Disfluency RemovalCode1
End-to-End Speech Recognition from Federated Acoustic ModelsCode1
Audio-Visual Efficient Conformer for Robust Speech RecognitionCode1
ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of KaldiCode1
Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling InsightsCode1
Fast Development of ASR in African Languages using Self Supervised Speech Representation LearningCode1
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
Improving Self-supervised Pre-training using Accent-Specific CodebooksCode1
FlanEC: Exploring Flan-T5 for Post-ASR Error CorrectionCode1
A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-SupervisionCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASRCode1
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionCode1
How2: A Large-scale Dataset for Multimodal Language UnderstandingCode1
A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applicationsCode1
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language ModelsCode1
An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems0
An Effective Training Framework for Light-Weight Automatic Speech Recognition Models0
A Deep Generative Acoustic Model for Compositional Automatic Speech Recognition0
An Effective, Performant Named Entity Recognition System for Noisy Business Telephone Conversation Transcripts0
An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement0
4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders0
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis0
An Effective End-to-End Modeling Approach for Mispronunciation Detection0
An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features0
Transformer-based Cascaded Multimodal Speech Translation0
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition0
An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution0
A bandit approach to curriculum generation for automatic speech recognition0
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems0
A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR0
Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition0
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering0
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings0
Anatomy of Industrial Scale Multilingual ASR0
An ASR-free Fluency Scoring Approach with Self-Supervised Learning0
Adaptive Multi-Corpora Language Model Training for Speech Recognition0
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation0
An ASR-Based Tutor for Learning to Read: How to Optimize Feedback to First Graders0
An Approach to Improve Robustness of NLP Systems against ASR Errors0
Adaptive Frequency Cepstral Coefficients for Word Mispronunciation Detection0
An analysis of incorporating an external language model into a sequence-to-sequence model0
An analysis of degenerating speech due to progressive dysarthria on ASR performance0
Adaptive Axonal Delays in feedforward spiking neural networks for accurate spoken word recognition0
ATCSpeech: a multilingual pilot-controller speech corpus from real Air Traffic Control environment0
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions0
Analyzing the Quality and Stability of a Streaming End-to-End On-Device Speech Recognizer0
Adaptive Activation Network For Low Resource Multilingual Speech Recognition0
Analyzing the Performance of Automatic Speech Recognition for Ageing Voice: Does it Correlate with Dependency Level?0
Analyzing the Importance of Blank for CTC-Based Knowledge Distillation0
Show:102550
← PrevPage 7 of 64Next →

No leaderboard results yet.