BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Back Translation for Speech-to-text Translation Without Transcripts May 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BembaSpeech: A Speech Recognition Corpus for the Bemba Language Feb 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Dompteur: Taming Audio Adversarial Examples Feb 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities Feb 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning Oct 17, 2024 Representation Learning Self-Supervised Learning
Code Code Available 15 AVATAR: Unconstrained Audiovisual Speech Recognition Jun 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Jun 16, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Distilling the Knowledge of BERT for Sequence-to-Sequence ASR Aug 9, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts Nov 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 15 Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Lyrics Transcription using Dilated Convolutional Neural Networks with Self-Attention Jul 13, 2020 Automatic Lyrics Transcription speech-recognition
Code Code Available 15 Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AV Taris: Online Audio-Visual Speech Recognition Dec 14, 2020 Action Detection Activity Detection
Code Code Available 15 AutoDiCE: Fully Automated Distributed CNN Inference at the Edge Jul 20, 2022 Code Generation image-classification
Code Code Available 15 Discriminative Multi-modality Speech Recognition May 12, 2020 Audio-Visual Speech Recognition Lipreading
Code Code Available 15 Differentiable Weighted Finite-State Transducers Oct 2, 2020 Handwriting Recognition speech-recognition
Code Code Available 15 Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC Sep 19, 2024 Disentanglement speech-recognition
Code Code Available 15 Distilling a Pretrained Language Model to a Multilingual ASR Model Jun 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DiariST: Streaming Speech Translation with Speaker Diarization Sep 14, 2023 speaker-diarization Speaker Diarization
Code Code Available 15 DiaCorrect: Error Correction Back-end For Speaker Diarization Sep 15, 2023 Automatic Speech Recognition Decoder
Code Code Available 15 DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Deep transfer operator learning for partial differential equations under conditional shift Apr 20, 2022 Domain Adaptation Operator learning
Code Code Available 15 Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention model for articulatory features detection Jul 2, 2019 Manner Of Articulation Detection model
Code Code Available 15 BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense Evaluation Nov 17, 2024 Action Recognition backdoor defense
Code Code Available 15 Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model Jun 2, 2023 speech-recognition Speech Recognition
Code Code Available 15 Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attack on practical speaker verification system using universal adversarial perturbations May 19, 2021 Real-World Adversarial Attack Room Impulse Response (RIR)
Code Code Available 15 Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency Nov 16, 2020 Compressive Sensing Edge-computing
Code Code Available 15 Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition Oct 26, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization Dec 11, 2020 Diversity Quantization
Code Code Available 15 Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition Dec 3, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing Mar 18, 2022 Representation Learning Speaker Verification
Code Code Available 15 A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline Sep 22, 2020 speech-recognition Speech Recognition
Code Code Available 15