SOTAVerified

Automatic Speech Recognition

Papers

Showing 22512300 of 3174 papers

TitleStatusHype
Multimodal Punctuation Prediction with Contextual Dropout0
Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding0
Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-training and Its Application to Children's ASR0
Hybrid phonetic-neural model for correction in speech recognition systemsCode0
NUVA: A Naming Utterance Verifier for Aphasia Treatment0
Sparsification via Compressed Sensing for Automatic Speech Recognition0
Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers0
Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced LanguagesCode0
A bandit approach to curriculum generation for automatic speech recognition0
Intermediate Loss Regularization for CTC-based Speech Recognition0
Multi-Task Self-Supervised Pre-Training for Music Classification0
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR0
Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy0
Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with SubwordsCode0
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition0
Speech Recognition by Simply Fine-tuning BERT0
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge0
Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yoloxóchitl Mixtec0
Streaming Models for Joint Speech Recognition and Translation0
Exploiting Beam Search Confidence for Energy-Efficient Speech Recognition0
Arabic Speech Recognition by End-to-End, Modular Systems and HumanCode0
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition0
Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications0
An evaluation of word-level confidence estimation for end-to-end automatic speech recognition0
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm0
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings0
Learning without Forgetting: Task Aware Multitask Learning for Multi-Modality Tasks0
Why Does Decentralized Training Outperform Synchronous Training In The Large Batch Setting?0
NAS-Bench-ASR: Reproducible Neural Architecture Search for Speech Recognition0
Multi-channel Multi-frame ADL-MVDR for Target Speech Separation0
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition0
A Hierarchical Reasoning Graph Neural Network for The Automatic Scoring of Answer Transcriptions in Video Job Interviews0
Adjust-free adversarial example generation in speech recognition using evolutionary multi-objective optimization under black-box condition0
Toward Streaming ASR with Non-Autoregressive Insertion-based Model0
User-friendly automatic transcription of low-resource languages: Plugging ESPnet into Elpis0
Exploring Transfer Learning For End-to-End Spoken Language Understanding0
A review of on-device fully neural end-to-end automatic speech recognition algorithms0
Less Is More: Improved RNN-T Decoding Using Limited Label Context and Path Merging0
Improved Robustness to Disfluencies in RNN-Transducer Based Speech Recognition0
On Knowledge Distillation for Direct Speech Translation0
Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition0
Using multiple ASR hypotheses to boost i18n NLU performance0
MLS: A Large-Scale Multilingual Dataset for Speech ResearchCode0
End to End ASR System with Automatic Punctuation InsertionCode0
The Indigenous Languages Technology project at NRC Canada: An empowerment-oriented approach to developing language software0
Attentively Embracing Noise for Robust Latent Representation in BERTCode0
A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AICode0
Multi-task Learning of Spoken Language Understanding by Integrating N-Best Hypotheses with Hierarchical Attention0
Sparse Transcription0
German-Arabic Speech-to-Speech Translation for Psychiatric Diagnosis0
Show:102550
← PrevPage 46 of 64Next →

No leaderboard results yet.