Radically Old Way of Computing Spectra: Applications in End-to-End ASR Mar 25, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A context-aware knowledge transferring strategy for CTC-based ASR Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications Apr 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Golos: Russian Dataset for Speech Research Jun 18, 2021 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 15 Advancing Test-Time Adaptation in Wild Acoustic Test Settings Oct 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AVATAR: Unconstrained Audiovisual Speech Recognition Jun 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Accented Speech Recognition With Accent-specific Codebooks Oct 24, 2023 Accented Speech Recognition Automatic Speech Recognition
Code Code Available 15 AV Taris: Online Audio-Visual Speech Recognition Dec 14, 2020 Action Detection Activity Detection
Code Code Available 15 BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BembaSpeech: A Speech Recognition Corpus for the Bemba Language Feb 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 15 Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation Oct 24, 2022 Action Detection Activity Detection
Code Code Available 15 Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model Jan 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Mandarin Speech Recogntion with Block-augmented Transformer Jul 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation Nov 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Incorporating External POS Tagger for Punctuation Restoration Jun 12, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Brazilian Portuguese Speech Recognition Using Wav2vec 2.0 Jul 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Can we use Common Voice to train a Multi-Speaker TTS system? Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations Oct 5, 2022 Automatic Speech Recognition (ASR) Clustering
Code Code Available 15 A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings Aug 11, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 CL-MASR: A Continual Learning Benchmark for Multilingual ASR Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMT Oct 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition Jul 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Common Voice: A Massively-Multilingual Speech Corpus Dec 13, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One Feb 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Continuous speech separation: dataset and analysis Jan 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 CopyNE: Better Contextual ASR by Copying Named Entities May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation Models Jul 5, 2024 Adversarial Attack Automatic Speech Recognition
Code Code Available 15 Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Cross Attention Augmented Transducer Networks for Simultaneous Translation Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition May 16, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 CTC-synchronous Training for Monotonic Attention Model May 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi Apr 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ArTST: Arabic Text and Speech Transformer Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15