It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition Feb 8, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMT Oct 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CL-MASR: A Continual Learning Benchmark for Multilingual ASR Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CopyNE: Better Contextual ASR by Copying Named Entities May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Less Peaky and More Accurate CTC Forced Alignment by Label Priors Apr 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adaptation of Whisper models to child speech recognition Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Lightweight Adapter Tuning for Multilingual Speech Translation Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition Aug 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing Nov 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Mamba for Streaming ASR Combined with Unimodal Aggregation Sep 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula Dec 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adapting End-to-End Speech Recognition for Readable Subtitles May 25, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Minimum Bayes Risk Training for End-to-End Speaker-Attributed ASR Nov 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Speech Recognition for Speech Assessment of Persian Preschool Children Mar 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MM-ALT: A Multimodal Automatic Lyric Transcription System Jul 13, 2022 Action Detection Activity Detection
Code Code Available 1AVATAR: Unconstrained Audiovisual Speech Recognition Jun 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Multi-modal Dense Video Captioning Mar 17, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Natural Language Processing Advancements By Deep Learning: A Survey Mar 2, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Toolbox for Construction and Analysis of Speech Datasets Apr 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Open Source Automatic Speech Recognition for German Jul 26, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1PriMock57: A Dataset Of Primary Care Mock Consultations Apr 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Punctuation Restoration using Transformer Models for High-and Low-Resource Languages Nov 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR Feb 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Jun 16, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization Jun 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Skit-S2I: An Indian Accented Speech to Intent dataset Dec 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech Nov 19, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1An Investigation of End-to-End Models for Robust Speech Recognition Feb 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Speaker Recognition in the Wild May 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One Feb 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1