SOTAVerified

Automatic Speech Recognition

Papers

Showing 251300 of 3174 papers

TitleStatusHype
Fast Development of ASR in African Languages using Self Supervised Speech Representation LearningCode1
WaveGuard: Understanding and Mitigating Audio Adversarial ExamplesCode1
Transformer Language Models with LSTM-based Cross-utterance Information RepresentationCode1
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
Dompteur: Taming Audio Adversarial ExamplesCode1
BembaSpeech: A Speech Recognition Corpus for the Bemba LanguageCode1
BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG dataCode1
AV Taris: Online Audio-Visual Speech RecognitionCode1
metaCAT: A Metadata-based Task-oriented Chatbot Annotation ToolCode1
End-to-End Automatic Speech Recognition for GujaratiCode1
Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through GradientsCode1
Improving RNN Transducer Based ASR with Auxiliary TasksCode1
Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASRCode1
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech TranslationCode1
Punctuation Restoration using Transformer Models for High-and Low-Resource LanguagesCode1
Joint Masked CPC and CTC Training for ASRCode1
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech RecognitionCode1
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech RecognitionCode1
Pushing the Limits of Semi-Supervised Learning for Automatic Speech RecognitionCode1
Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An OverviewCode1
Representation Learning for Sequence Data with Deep Autoencoding Predictive ComponentsCode1
End-to-End Speech Recognition and Disfluency RemovalCode1
KoSpeech: Open-Source Toolkit for End-to-End Korean Speech RecognitionCode1
Sum-Product Networks for Robust Automatic Speaker IdentificationCode1
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker RecordingsCode1
Distilling the Knowledge of BERT for Sequence-to-Sequence ASRCode1
Word Error Rate Estimation Without ASR Output: e-WER2Code1
Pretraining Techniques for Sequence-to-Sequence Voice ConversionCode1
Automatic Speech Recognition Benchmark for Air-Traffic CommunicationsCode1
AVLnet: Learning Audio-Visual Language Representations from Instructional VideosCode1
Learning to Count Words in Fluent Speech enables Online Speech RecognitionCode1
On the Comparison of Popular End-to-End Models for Large Scale Speech RecognitionCode1
Adapting End-to-End Speech Recognition for Readable SubtitlesCode1
End-to-end Named Entity Recognition from English SpeechCode1
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASRCode1
Enhancing Monotonic Multihead Attention for Streaming ASRCode1
Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech RecognitionCode1
Improved Noisy Student Training for Automatic Speech RecognitionCode1
CTC-synchronous Training for Monotonic Attention ModelCode1
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global ContextCode1
A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applicationsCode1
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact CentersCode1
Transformer based Grapheme-to-Phoneme ConversionCode1
Multi-modal Dense Video CaptioningCode1
Morfessor EM+Prune: Improved Subword Segmentation with Expectation Maximization and PruningCode1
Natural Language Processing Advancements By Deep Learning: A SurveyCode1
Unsupervised pretraining transfers well across languagesCode1
Continuous speech separation: dataset and analysisCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
Deep Contextualized Acoustic Representations For Semi-Supervised Speech RecognitionCode1
Show:102550
← PrevPage 6 of 64Next →

No leaderboard results yet.