Distilling the Knowledge of BERT for Sequence-to-Sequence ASR Aug 9, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1dMel: Speech Tokenization made Simple Jul 22, 2024 Decoder Language Modeling
Code Code Available 1Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation Nov 2, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning Oct 17, 2024 Representation Learning Self-Supervised Learning
Code Code Available 1DiariST: Streaming Speech Translation with Speaker Diarization Sep 14, 2023 speaker-diarization Speaker Diarization
Code Code Available 1DiaCorrect: Error Correction Back-end For Speaker Diarization Sep 15, 2023 Automatic Speech Recognition Decoder
Code Code Available 1DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Differentiable Weighted Finite-State Transducers Oct 2, 2020 Handwriting Recognition speech-recognition
Code Code Available 1Deep Speech: Scaling up end-to-end speech recognition Dec 17, 2014 Accented Speech Recognition Speech Recognition
Code Code Available 1A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 1Deep transfer operator learning for partial differential equations under conditional shift Apr 20, 2022 Domain Adaptation Operator learning
Code Code Available 1Adapting End-to-End Speech Recognition for Readable Subtitles May 25, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Deep Speech 2: End-to-End Speech Recognition in English and Mandarin Dec 8, 2015 Accented Speech Recognition Noisy Speech Recognition
Code Code Available 1Discriminative Multi-modality Speech Recognition May 12, 2020 Audio-Visual Speech Recognition Lipreading
Code Code Available 1Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency Nov 16, 2020 Compressive Sensing Edge-computing
Code Code Available 1Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition Dec 3, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition Oct 26, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing Mar 18, 2022 Representation Learning Speaker Verification
Code Code Available 1DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization Dec 11, 2020 Diversity Quantization
Code Code Available 1Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis May 9, 2022 Deep Learning Semantic Communication
Code Code Available 1Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition May 16, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1DARF: A data-reduced FADE version for simulations of speech recognition thresholds with real hearing aids Jul 10, 2020 Sentence speech-recognition
Code Code Available 1CoVoST 2 and Massively Multilingual Speech-to-Text Translation Jul 20, 2020 Machine Translation speech-recognition
Code Code Available 1CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Cross Attention Augmented Transducer Networks for Simultaneous Translation Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language Feb 7, 2022 image-classification Image Classification
Code Code Available 1Deep Sparse Conformer for Speech Recognition Sep 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC Sep 19, 2024 Disentanglement speech-recognition
Code Code Available 1Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI Dec 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Continuous speech separation: dataset and analysis Jan 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Compiling ONNX Neural Network Models Using MLIR Aug 19, 2020 speech-recognition Speech Recognition
Code Code Available 1Comparative layer-wise analysis of self-supervised speech models Nov 8, 2022 speech-recognition Speech Recognition
Code Code Available 1Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition Feb 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context May 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BERTphone: Phonetically-Aware Encoder Representations for Utterance-Level Speaker and Language Recognition Jun 30, 2019 Avg Representation Learning
Code Code Available 1Common Voice: A Massively-Multilingual Speech Corpus Dec 13, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline Sep 22, 2020 speech-recognition Speech Recognition
Code Code Available 1Communication-Efficient Learning of Deep Networks from Decentralized Data Feb 17, 2016 Federated Learning Speech Recognition
Code Code Available 1CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution Jul 4, 2022 Compiler Optimization image-classification
Code Code Available 1Computer-Generated Music for Tabletop Role-Playing Games Aug 16, 2020 speech-recognition Speech Recognition
Code Code Available 1Cross-Speaker Encoding Network for Multi-Talker Speech Recognition Jan 8, 2024 Decoder speech-recognition
Code Code Available 1CTC-synchronous Training for Monotonic Attention Model May 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1D4AM: A General Denoising Framework for Downstream Acoustic Models Nov 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Contrastive Learning-Based Audio to Lyrics Alignment for Multiple Languages Jun 13, 2023 Contrastive Learning speech-recognition
Code Code Available 1A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1