How2: A Large-scale Dataset for Multimodal Language Understanding Nov 1, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation May 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition Apr 17, 2020 Audio-Visual Speech Recognition speech-recognition
Code Code Available 15 GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition Nov 8, 2023 CPU Decoder
Code Code Available 15 Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis May 9, 2022 Deep Learning Semantic Communication
Code Code Available 15 MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth Information Jun 4, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 15 Deep Speech: Scaling up end-to-end speech recognition Dec 17, 2014 Accented Speech Recognition Speech Recognition
Code Code Available 15 Deep Speech 2: End-to-End Speech Recognition in English and Mandarin Dec 8, 2015 Accented Speech Recognition Noisy Speech Recognition
Code Code Available 15 metaCAT: A Metadata-based Task-oriented Chatbot Annotation Tool Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Deep transfer operator learning for partial differential equations under conditional shift Apr 20, 2022 Domain Adaptation Operator learning
Code Code Available 15 Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models Dec 5, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Adaptation of Whisper models to child speech recognition Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Advancing Test-Time Adaptation in Wild Acoustic Test Settings Oct 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention model for articulatory features detection Jul 2, 2019 Manner Of Articulation Detection model
Code Code Available 15 Graph Convolutions Enrich the Self-Attention in Transformers! Dec 7, 2023 Clone Detection
Code Code Available 15 Attention-Based Models for Speech Recognition Jun 24, 2015 Machine Translation Phoneme Recognition
Code Code Available 15 A context-aware knowledge transferring strategy for CTC-based ASR Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Differentiable Weighted Finite-State Transducers Oct 2, 2020 Handwriting Recognition speech-recognition
Code Code Available 15 GPU-accelerated Guided Source Separation for Meeting Transcription Dec 10, 2022 blind source separation CPU
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC Sep 19, 2024 Disentanglement speech-recognition
Code Code Available 15 A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 15 Discriminative Multi-modality Speech Recognition May 12, 2020 Audio-Visual Speech Recognition Lipreading
Code Code Available 15 Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Distilling a Pretrained Language Model to a Multilingual ASR Model Jun 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Distilling the Knowledge of BERT for Sequence-to-Sequence ASR Aug 9, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 GPU-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech Recognition Oct 22, 2019 CPU Decoder
Code Code Available 15 DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays Nov 3, 2020 Diversity Noise Estimation
Code Code Available 15 HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions Sep 18, 2022 object-detection Object Detection
Code Code Available 15 Multi-task self-supervised learning for Robust Speech Recognition Jan 25, 2020 Robust Speech Recognition Self-Supervised Learning
Code Code Available 15 Do VSR Models Generalize Beyond LRS3? Nov 23, 2023 Lip Reading speech-recognition
Code Code Available 15 Dompteur: Taming Audio Adversarial Examples Feb 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DOVER: A Method for Combining Diarization Outputs Sep 17, 2019 speech-recognition Speech Recognition
Code Code Available 15 Natural Language Processing Advancements By Deep Learning: A Survey Mar 2, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Adapting Pretrained Transformer to Lattices for Spoken Language Understanding Nov 2, 2020 Natural Language Understanding speech-recognition
Code Code Available 15 Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation Nov 2, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering Mar 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units Jun 14, 2021 Clustering Language Modelling
Code Code Available 15 Improving Transformer-based Speech Recognition Using Unsupervised Pre-training Oct 22, 2019 speech-recognition Speech Recognition
Code Code Available 15 Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling Oct 8, 2020 Speech Recognition text-to-speech
Code Code Available 15 Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM Sep 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition May 28, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15