Joint Masked CPC and CTC Training for ASR Oct 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation Learning Oct 27, 2020 Emotion Recognition Representation Learning
Code Code Available 1Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition Oct 26, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition Oct 20, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Towards Resistant Audio Adversarial Examples Oct 14, 2020 Adversarial Attack speech-recognition
Code Code Available 1Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview Oct 14, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling Oct 8, 2020 Speech Recognition text-to-speech
Code Code Available 1Representation Learning for Sequence Data with Deep Autoencoding Predictive Components Oct 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Online Neural Networks for Change-Point Detection Oct 3, 2020 Change Point Detection speech-recognition
Code Code Available 1Differentiable Weighted Finite-State Transducers Oct 2, 2020 Handwriting Recognition speech-recognition
Code Code Available 1Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models Oct 1, 2020 Language Modeling Language Modelling
Code Code Available 1End-to-End Speech Recognition and Disfluency Removal Sep 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline Sep 22, 2020 speech-recognition Speech Recognition
Code Code Available 1Consecutive Decoding for Speech-to-text Translation Sep 21, 2020 Decoder Machine Translation
Code Code Available 1KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition Sep 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation Sep 6, 2020 Domain Adaptation speech-recognition
Code Code Available 1Any-to-Many Voice Conversion with Location-Relative Sequence-to-Sequence Modeling Sep 6, 2020 feature selection speech-recognition
Code Code Available 1Compiling ONNX Neural Network Models Using MLIR Aug 19, 2020 speech-recognition Speech Recognition
Code Code Available 1Computer-Generated Music for Tabletop Role-Playing Games Aug 16, 2020 speech-recognition Speech Recognition
Code Code Available 1Sum-Product Networks for Robust Automatic Speaker Identification Aug 13, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings Aug 11, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Distilling the Knowledge of BERT for Sequence-to-Sequence ASR Aug 9, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Word Error Rate Estimation Without ASR Output: e-WER2 Aug 8, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Pretraining Techniques for Sequence-to-Sequence Voice Conversion Aug 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Online Spatio-Temporal Learning in Deep Neural Networks Jul 24, 2020 Language Modelling speech-recognition
Code Code Available 1CoVoST 2 and Massively Multilingual Speech-to-Text Translation Jul 20, 2020 Machine Translation speech-recognition
Code Code Available 1Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention Jul 13, 2020 Automatic Lyrics Transcription speech-recognition
Code Code Available 1TERA: Self-Supervised Learning of Transformer Encoder Representation for Speech Jul 12, 2020 Keyword Spotting Self-Supervised Learning
Code Code Available 1DARF: A data-reduced FADE version for simulations of speech recognition thresholds with real hearing aids Jul 10, 2020 Sentence speech-recognition
Code Code Available 1AdaScale SGD: A User-Friendly Algorithm for Distributed Training Jul 9, 2020 image-classification Image Classification
Code Code Available 1Unsupervised Cross-lingual Representation Learning for Speech Recognition Jun 24, 2020 Quantization Representation Learning
Code Code Available 1Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Jun 16, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Emotion Recognition in Audio and Video Using Deep Neural Networks Jun 15, 2020 Deep Learning Emotion Recognition
Code Code Available 1Learning to Count Words in Fluent Speech enables Online Speech Recognition Jun 8, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improved acoustic word embeddings for zero-resource languages using multilingual transfer Jun 2, 2020 speech-recognition Speech Recognition
Code Code Available 1PolyDL: Polyhedral Optimizations for Creation of High Performance DL primitives Jun 2, 2020 speech-recognition Speech Recognition
Code Code Available 1Subword RNNLM Approximations for Out-Of-Vocabulary Keyword Search May 28, 2020 speech-recognition Speech Recognition
Code Code Available 1On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition May 28, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adapting End-to-End Speech Recognition for Readable Subtitles May 25, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-end Named Entity Recognition from English Speech May 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR May 20, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition May 20, 2020 speech-recognition Speech Recognition
Code Code Available 1Improved Noisy Student Training for Automatic Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1GEV Beamforming Supported by DOA-based Masks Generated on Pairs of Microphones May 19, 2020 speech-recognition Speech Recognition
Code Code Available 1Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition May 19, 2020 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1Enhancing Monotonic Multihead Attention for Streaming ASR May 19, 2020 All Automatic Speech Recognition
Code Code Available 1Speech Recognition and Multi-Speaker Diarization of Long Conversations May 16, 2020 Data Augmentation speaker-diarization
Code Code Available 1