Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Graph Convolutions Enrich the Self-Attention in Transformers! Dec 7, 2023 Clone Detection
Code Code Available 15 Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model Jun 25, 2024 Automatic Lyrics Transcription Automatic Speech Recognition
Code Code Available 15 Towards Improved Room Impulse Response Estimation for Speech Recognition Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Towards Resistant Audio Adversarial Examples Oct 14, 2020 Adversarial Attack speech-recognition
Code Code Available 15 HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions Sep 18, 2022 object-detection Object Detection
Code Code Available 15 Language and Speech Technology for Central Kurdish Varieties Mar 4, 2024 Automatic Speech Recognition Diversity
Code Code Available 15 Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition Jun 18, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 15 Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 HiFi-VC: High Quality ASR-Based Voice Conversion Mar 31, 2022 speech-recognition Speech Recognition
Code Code Available 15 Integrating Lattice-Free MMI into End-to-End Speech Recognition Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation May 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages Aug 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings Aug 11, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation Nov 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Vietnamese Named Entity Recognition from Speech Using Word Capitalization and Punctuation Recovery Models Oct 1, 2020 Language Modeling Language Modelling
Code Code Available 15 Imputer: Sequence Modelling via Imputation and Dynamic Programming Feb 20, 2020 Imputation speech-recognition
Code Code Available 15 Improving Self-supervised Pre-training using Accent-Specific Codebooks Jul 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model Jan 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ASR Error Correction with Constrained Decoding on Operation Prediction Aug 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Mandarin Speech Recogntion with Block-augmented Transformer Jul 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving Transformer-based Speech Recognition Using Unsupervised Pre-training Oct 22, 2019 speech-recognition Speech Recognition
Code Code Available 15 Incorporating External POS Tagger for Punctuation Restoration Jun 12, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus Jul 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improved Open Source Automatic Subtitling for Lecture Videos Sep 1, 2022 Speech Recognition
Code Code Available 15 A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving RNN Transducer Based ASR with Auxiliary Tasks Nov 5, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improved DeepFake Detection Using Whisper Features Jun 2, 2023 Automatic Speech Recognition DeepFake Detection
Code Code Available 15 A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 An Investigation of End-to-End Models for Robust Speech Recognition Feb 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attack on practical speaker verification system using universal adversarial perturbations May 19, 2021 Real-World Adversarial Attack Room Impulse Response (RIR)
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-Based Models for Speech Recognition Jun 24, 2015 Machine Translation Phoneme Recognition
Code Code Available 15 A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One Feb 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention model for articulatory features detection Jul 2, 2019 Manner Of Articulation Detection model
Code Code Available 15 Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text Apr 3, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition Feb 8, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 AutoDiCE: Fully Automated Distributed CNN Inference at the Edge Jul 20, 2022 Code Generation image-classification
Code Code Available 15 Improved Noisy Student Training for Automatic Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improved training of end-to-end attention models for speech recognition May 8, 2018 Language Modeling Language Modelling
Code Code Available 15 A Resource for Computational Experiments on Mapudungun Dec 4, 2019 Machine Translation speech-recognition
Code Code Available 15