SOTAVerified

Automatic Speech Recognition

Papers

Showing 22012250 of 3174 papers

TitleStatusHype
Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy0
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition0
Speech Recognition by Simply Fine-tuning BERT0
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge0
BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG dataCode1
Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yoloxóchitl Mixtec0
Exploiting Beam Search Confidence for Energy-Efficient Speech Recognition0
Streaming Models for Joint Speech Recognition and Translation0
Arabic Speech Recognition by End-to-End, Modular Systems and HumanCode0
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition0
Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications0
An evaluation of word-level confidence estimation for end-to-end automatic speech recognition0
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm0
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings0
Why Does Decentralized Training Outperform Synchronous Training In The Large Batch Setting?0
Learning without Forgetting: Task Aware Multitask Learning for Multi-Modality Tasks0
NAS-Bench-ASR: Reproducible Neural Architecture Search for Speech Recognition0
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation0
A Hierarchical Reasoning Graph Neural Network for The Automatic Scoring of Answer Transcriptions in Video Job Interviews0
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition0
Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition0
Toward Streaming ASR with Non-Autoregressive Insertion-based Model0
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis0
Exploring Transfer Learning For End-to-End Spoken Language Understanding0
A review of on-device fully neural end-to-end automatic speech recognition algorithms0
AV Taris: Online Audio-Visual Speech RecognitionCode1
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging0
Improved Robustness to Disfluencies in RNN-Transducer Based Speech Recognition0
On Knowledge Distillation for Direct Speech Translation0
Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition0
Using multiple ASR hypotheses to boost i18n NLU performance0
MLS: A Large-Scale Multilingual Dataset for Speech ResearchCode0
End to End ASR System with Automatic Punctuation InsertionCode0
ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss German0
Sparse Transcription0
German-Arabic Speech-to-Speech Translation for Psychiatric Diagnosis0
End-to-End Automatic Speech Recognition for GujaratiCode1
On-Device detection of sentence completion for voice assistants with low-memory footprint0
A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AICode0
The Indigenous Languages Technology project at NRC Canada: An empowerment-oriented approach to developing language software0
Multi-task Learning of Spoken Language Understanding by Integrating N-Best Hypotheses with Hierarchical Attention0
Attentively Embracing Noise for Robust Latent Representation in BERTCode0
100,000 Podcasts: A Spoken English Document Corpus0
metaCAT: A Metadata-based Task-oriented Chatbot Annotation ToolCode1
Transformer-Transducers for Code-Switched Speech Recognition0
Improving accuracy of rare words for RNN-Transducer through unigram shallow fusion0
Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training0
Bootstrap an end-to-end ASR system by multilingual training, transfer learning, text-to-text mapping and synthetic audio0
Adam^+: A Stochastic Method with Adaptive Variance Reduction0
Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems0
Show:102550
← PrevPage 45 of 64Next →

No leaderboard results yet.