DOVER: A Method for Combining Diarization Outputs Sep 17, 2019 speech-recognition Speech Recognition
Code Code Available 15 BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 15 AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Jun 16, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Dompteur: Taming Audio Adversarial Examples Feb 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Do VSR Models Generalize Beyond LRS3? Nov 23, 2023 Lip Reading speech-recognition
Code Code Available 15 DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering Mar 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning Oct 17, 2024 Representation Learning Self-Supervised Learning
Code Code Available 15 Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model Jun 2, 2023 speech-recognition Speech Recognition
Code Code Available 15 Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention Jul 13, 2020 Automatic Lyrics Transcription speech-recognition
Code Code Available 15 Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 dMel: Speech Tokenization made Simple Jul 22, 2024 Decoder Language Modeling
Code Code Available 15 A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 15 Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AutoDiCE: Fully Automated Distributed CNN Inference at the Edge Jul 20, 2022 Code Generation image-classification
Code Code Available 15 DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays Nov 3, 2020 Diversity Noise Estimation
Code Code Available 15 Distilling a Pretrained Language Model to a Multilingual ASR Model Jun 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC Sep 19, 2024 Disentanglement speech-recognition
Code Code Available 15 Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 Discriminative Multi-modality Speech Recognition May 12, 2020 Audio-Visual Speech Recognition Lipreading
Code Code Available 15 Distilling the Knowledge of BERT for Sequence-to-Sequence ASR Aug 9, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention model for articulatory features detection Jul 2, 2019 Manner Of Articulation Detection model
Code Code Available 15 DiariST: Streaming Speech Translation with Speaker Diarization Sep 14, 2023 speaker-diarization Speaker Diarization
Code Code Available 15 DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DiaCorrect: Error Correction Back-end For Speaker Diarization Sep 15, 2023 Automatic Speech Recognition Decoder
Code Code Available 15 Differentiable Weighted Finite-State Transducers Oct 2, 2020 Handwriting Recognition speech-recognition
Code Code Available 15 Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts Nov 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency Nov 16, 2020 Compressive Sensing Edge-computing
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition Dec 3, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization Dec 11, 2020 Diversity Quantization
Code Code Available 15 Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-Based Models for Speech Recognition Jun 24, 2015 Machine Translation Phoneme Recognition
Code Code Available 15 Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis May 9, 2022 Deep Learning Semantic Communication
Code Code Available 15 A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline Sep 22, 2020 speech-recognition Speech Recognition
Code Code Available 15 A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing Mar 18, 2022 Representation Learning Speaker Verification
Code Code Available 15 Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition Oct 26, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15