SOTAVerified

Automatic Speech Recognition

Papers

Showing 23512400 of 3174 papers

TitleStatusHype
Multi-view Attention-based Speech Enhancement Model for Noise-robust Automatic Speech Recognition0
Data augmentation using prosody and false starts to recognize non-native children's speechCode0
Learned Transferable Architectures Can Surpass Hand-Designed Architectures for Large Scale Speech Recognition0
Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts0
Improving Tail Performance of a Deliberation E2E ASR Model Using a Large Text Corpus0
Cross-Utterance Language Models with Acoustic Error Sampling0
Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical StudyCode0
Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces0
Sum-Product Networks for Robust Automatic Speaker IdentificationCode1
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition0
MASRI-HEADSET: A Maltese Corpus for Speech Recognition0
Large-scale Transfer Learning for Low-resource Spoken Language Understanding0
Online Automatic Speech Recognition with Listen, Attend and Spell Model0
Transfer Learning Approaches for Streaming End-to-End Speech Recognition System0
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker RecordingsCode1
Transformer with Bidirectional Decoder for Speech Recognition0
Subword Regularization: An Analysis of Scalability and Generalization for End-to-End Automatic Speech Recognition0
LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition0
Distilling the Knowledge of BERT for Sequence-to-Sequence ASRCode1
Word Error Rate Estimation Without ASR Output: e-WER2Code1
Deep Learning Based Dereverberation of Temporal Envelopesfor Robust Speech Recognition0
Pretraining Techniques for Sequence-to-Sequence Voice ConversionCode1
Investigation of Speaker-adaptation methods in Transformer based ASR0
Unsupervised Cross-Domain Singing Voice Conversion0
Shouted Speech Compensation for Speaker Verification Robust to Vocal Effort Conditions0
A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition0
Iterative Compression of End-to-End ASR Model using AutoML0
"This is Houston. Say again, please". The Behavox system for the Apollo-11 Fearless Steps Challenge (phase II)0
Weakly Supervised Construction of ASR Systems with Massive Video Data0
Modular End-to-end Automatic Speech Recognition Framework for Acoustic-to-word Model0
Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones0
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability0
Exploiting Cross-Lingual Knowledge in Unsupervised Acoustic Modeling for Low-Resource Languages0
Neural Kalman Filtering for Speech Enhancement0
Effects of Language Relatedness for Cross-lingual Transfer Learning in Character-Based Language Models0
Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech Recognition0
Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks0
Towards an Automated SOAP Note: Classifying Utterances from Medical Conversations0
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems0
Fast Transformers with Clustered AttentionCode2
Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters0
Robust Prediction of Punctuation and Truecasing for Medical ASR0
Deep Graph Random Process for Relational-Thinking-Based Speech Recognition0
CUNI Neural ASR with Phoneme-Level Intermediate Step for -Native at IWSLT 20200
Large Vocabulary Read Speech Corpora for Four Ethiopian Languages: Amharic, Tigrigna, Oromo, and Wolaytta0
End-to-End Offline Speech Translation System for IWSLT 2020 using Modality Agnostic Meta-Learning0
The AFRL IWSLT 2020 Systems: Work-From-Home Edition0
Towards Understanding ASR Error Correction for Medical Conversations0
Start-Before-End and End-to-End: Neural Speech Translation by AppTek and RWTH Aachen University0
Tigrinya Automatic Speech recognition with Morpheme based recognition units0
Show:102550
← PrevPage 48 of 64Next →

No leaderboard results yet.