It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition Feb 8, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 Word Error Rate Estimation Without ASR Output: e-WER2 Aug 8, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMT Oct 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ASR Error Correction with Constrained Decoding on Operation Prediction Aug 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Learning to Count Words in Fluent Speech enables Online Speech Recognition Jun 8, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Leveraging pre-trained representations to improve access to untranscribed speech from endangered languages Mar 26, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One Feb 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Large-Scale Streaming End-to-End Speech Translation with Neural Transducers Apr 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs Jun 26, 2024 ArzEn Code-switched Translation to ara ArzEn Code-switched Translation to eng
Code Code Available 15 A Comparison of Methods for OOV-word Recognition on a New Public Dataset Jul 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Layer-wise Analysis of a Self-supervised Speech Representation Model Jul 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners May 22, 2022 Attribute Automatic Speech Recognition
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech Apr 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ArTST: Arabic Text and Speech Transformer Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 An Investigation of End-to-End Models for Robust Speech Recognition Feb 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic Features Aug 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 LAE: Language-Aware Encoder for Monolingual and Multilingual ASR Jun 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AVATAR: Unconstrained Audiovisual Speech Recognition Jun 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Jun 16, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 15 BembaSpeech: A Speech Recognition Corpus for the Bemba Language Feb 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG data Jan 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Learning Audio-Visual Dereverberation Jun 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 metaCAT: A Metadata-based Task-oriented Chatbot Annotation Tool Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR Nov 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition Sep 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Persian ASR-based SER: Modification of Sharif Emotional Speech Database and Investigation of Persian Text Corpora Nov 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Advancing Test-Time Adaptation in Wild Acoustic Test Settings Oct 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation Oct 24, 2022 Action Detection Activity Detection
Code Code Available 15 Single-Channel Multi-Speaker Separation using Deep Clustering Jul 7, 2016 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 MT3: Multi-Task Multitrack Music Transcription Nov 4, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Iterative Pseudo-Labeling for Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 A Comparison of Adaptation Techniques and Recurrent Neural Network Architectures Jul 12, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information Apr 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC Videos Jul 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05