Earnings-22: A Practical Benchmark for Accents in the Wild Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition Feb 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1EnCodecMAE: Leveraging neural codecs for universal audio representation learning Sep 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-end Named Entity Recognition from English Speech May 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adaptation of Whisper models to child speech recognition Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Enhancing Monotonic Multihead Attention for Streaming ASR May 19, 2020 All Automatic Speech Recognition
Code Code Available 1Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention Oct 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adapting End-to-End Speech Recognition for Readable Subtitles May 25, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Espresso: A Fast End-to-end Neural Speech Recognition Toolkit Sep 18, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition Oct 26, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Continuous speech separation: dataset and analysis Jan 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1FAST-RIR: Fast neural diffuse room impulse response generator Oct 7, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation Models Jul 5, 2024 Adversarial Attack Automatic Speech Recognition
Code Code Available 1From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition May 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Golos: Russian Dataset for Speech Research Jun 18, 2021 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 1Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview Oct 14, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models Sep 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HypR: A comprehensive study for ASR hypothesis revising with a reference corpus Sep 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improved Noisy Student Training for Automatic Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model Jan 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 1Improving Self-supervised Pre-training using Accent-Specific Codebooks Jul 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Incorporating External POS Tagger for Punctuation Restoration Jun 12, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI Dec 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context May 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CopyNE: Better Contextual ASR by Copying Named Entities May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Common Voice: A Massively-Multilingual Speech Corpus Dec 13, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Accented Speech Recognition With Accent-specific Codebooks Oct 24, 2023 Accented Speech Recognition Automatic Speech Recognition
Code Code Available 1Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition Feb 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition Jul 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations Oct 5, 2022 Automatic Speech Recognition (ASR) Clustering
Code Code Available 1A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications Apr 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A context-aware knowledge transferring strategy for CTC-based ASR Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Can Contextual Biasing Remain Effective with Whisper and GPT-2? Jun 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Can we use Common Voice to train a Multi-Speaker TTS system? Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CL-MASR: A Continual Learning Benchmark for Multilingual ASR Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 1BembaSpeech: A Speech Recognition Corpus for the Bemba Language Feb 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG data Jan 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Brazilian Portuguese Speech Recognition Using Wav2vec 2.0 Jul 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Advancing Test-Time Adaptation in Wild Acoustic Test Settings Oct 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Jun 16, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AV Taris: Online Audio-Visual Speech Recognition Dec 14, 2020 Action Detection Activity Detection
Code Code Available 1A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1