End-to-End Speech Recognition from Federated Acoustic Models Apr 29, 2021 2k 4k
Code Code Available 1Using Radio Archives for Low-Resource Speech Recognition: Towards an Intelligent Virtual Assistant for Illiterate Users Apr 27, 2021 Language Identification Representation Learning
Code Code Available 1LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech Apr 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Toolbox for Construction and Analysis of Speech Datasets Apr 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1RNN Transducer Models For Spoken Language Understanding Apr 8, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings Apr 8, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 1Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible Inputs Apr 7, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Librispeech Transducer Model with Internal Language Model Prior Correction Apr 7, 2021 Language Modeling Language Modelling
Code Code Available 1Learning to Rank Microphones for Distant Speech Recognition Apr 6, 2021 channel selection Decoder
Code Code Available 1ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi Apr 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Keyword Transformer: A Self-Attention Model for Keyword Spotting Apr 1, 2021 Keyword Spotting Speech Recognition
Code Code Available 1Multilingual and code-switching ASR challenges for low resource Indian languages Apr 1, 2021 Automatic Speech Recognition (ASR) Sentence
Code Code Available 1Integer-only Zero-shot Quantization for Efficient Speech Recognition Mar 31, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MediaSpeech: Multilanguage ASR Benchmark and Dataset Mar 30, 2021 speech-recognition Speech Recognition
Code Code Available 1Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays Mar 28, 2021 speech-recognition Speech Recognition
Code Code Available 1Leveraging pre-trained representations to improve access to untranscribed speech from endangered languages Mar 26, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Radically Old Way of Computing Spectra: Applications in End-to-End ASR Mar 25, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning Mar 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges Mar 8, 2021 Autonomous Vehicles image-classification
Code Code Available 1WaveGuard: Understanding and Mitigating Audio Adversarial Examples Mar 4, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Transformer Language Models with LSTM-based Cross-utterance Information Representation Feb 12, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1An Investigation of End-to-End Models for Robust Speech Recognition Feb 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Dompteur: Taming Audio Adversarial Examples Feb 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BembaSpeech: A Speech Recognition Corpus for the Bemba Language Feb 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG data Jan 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data Jan 19, 2021 Multi-Task Learning Representation Learning
Code Code Available 1Learning Efficient Representations for Keyword Spotting with Triplet Loss Jan 12, 2021 Classification Keyword Spotting
Code Code Available 1VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation Jan 2, 2021 Representation Learning speech-recognition
Code Code Available 1Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps Dec 29, 2020 All image-classification
Code Code Available 1Scalable Optical Learning Operator Dec 22, 2020 speech-recognition Speech Recognition
Code Code Available 1Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection Dec 14, 2020 DeepFake Detection Lipreading
Code Code Available 1AV Taris: Online Audio-Visual Speech Recognition Dec 14, 2020 Action Detection Activity Detection
Code Code Available 1DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization Dec 11, 2020 Diversity Quantization
Code Code Available 1Unified Streaming and Non-streaming Two-pass End-to-end Model for Speech Recognition Dec 10, 2020 Decoder Sentence
Code Code Available 1SpeakingFaces: A Large-Scale Multimodal Dataset of Voice Commands with Visual and Thermal Video Streams Dec 5, 2020 speech-recognition Speech Recognition
Code Code Available 1metaCAT: A Metadata-based Task-oriented Chatbot Annotation Tool Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-End Automatic Speech Recognition for Gujarati Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Deep Discriminative Feature Learning for Accent Recognition Nov 25, 2020 Face Recognition Speaker Identification
Code Code Available 1Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency Nov 16, 2020 Compressive Sensing Edge-computing
Code Code Available 1Learn an Effective Lip Reading Model without Pains Nov 15, 2020 Lipreading Lip Reading
Code Code Available 1Text Augmentation for Language Models in High Error Recognition Scenario Nov 11, 2020 Data Augmentation speech-recognition
Code Code Available 1Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients Nov 11, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Nanopore Base Calling on the Edge Nov 9, 2020 speech-recognition Speech Recognition
Code Code Available 1Improving RNN Transducer Based ASR with Auxiliary Tasks Nov 5, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays Nov 3, 2020 Diversity Noise Estimation
Code Code Available 1Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR Nov 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation Nov 2, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adapting Pretrained Transformer to Lattices for Spoken Language Understanding Nov 2, 2020 Natural Language Understanding speech-recognition
Code Code Available 1Punctuation Restoration using Transformer Models for High-and Low-Resource Languages Nov 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1