KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition Sep 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Multilingual and code-switching ASR challenges for low resource Indian languages Apr 1, 2021 Automatic Speech Recognition (ASR) Sentence
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Toolbox for Construction and Analysis of Speech Datasets Apr 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition Mar 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Distilling a Pretrained Language Model to a Multilingual ASR Model Jun 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Self-supervised Pre-training using Accent-Specific Codebooks Jul 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Adaptation of Whisper models to child speech recognition Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Dompteur: Taming Audio Adversarial Examples Feb 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering Mar 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities Feb 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 EnCodecMAE: Leveraging neural codecs for universal audio representation learning Sep 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation Nov 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Incorporating External POS Tagger for Punctuation Restoration Jun 12, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition Feb 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition Aug 31, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients Nov 11, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model Jan 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 End-to-end Named Entity Recognition from English Speech May 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Mandarin Speech Recogntion with Block-augmented Transformer Jul 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 End-to-End Speech Recognition and Disfluency Removal Sep 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement Dec 21, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 End-to-End Speech Recognition from Federated Acoustic Models Apr 29, 2021 2k 4k
Code Code Available 15 Representation Learning for Sequence Data with Deep Autoencoding Predictive Components Oct 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One Feb 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention Oct 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition Oct 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Espresso: A Fast End-to-end Neural Speech Recognition Toolkit Sep 18, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction Jul 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi Apr 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Extending Whisper with prompt tuning to target-speaker ASR Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 FAST-RIR: Fast neural diffuse room impulse response generator Oct 7, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ASR Error Correction with Constrained Decoding on Operation Prediction Aug 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving RNN Transducer Based ASR with Auxiliary Tasks Nov 5, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Radically Old Way of Computing Spectra: Applications in End-to-End ASR Mar 25, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces May 25, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset May 12, 2024 Action Spotting Automatic Speech Recognition
Code Code Available 15 indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ArTST: Arabic Text and Speech Transformer Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 15 A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 HypR: A comprehensive study for ASR hypothesis revising with a reference corpus Sep 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15