| Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations | Aug 14, 2023 | Action DetectionActivity Detection | CodeCode Available | 0 |
| Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text Corpus | Oct 6, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Multimodal Speech Recognition for Language-Guided Embodied Agents | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition | Sep 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task Learning | Oct 25, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese | Apr 28, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Syllable Subword Tokens for Open Vocabulary Speech Recognition in Malayalam | Jan 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Segmentation-Free Streaming Machine Translation | Sep 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Adapting the adapters for code-switching in multilingual ASR | Oct 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition | Jul 5, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Selective Attention Merging for low resource tasks: A case study of Child ASR | Jan 14, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Preserving spoken content in voice anonymisation with character-level vocoder conditioning | Aug 8, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Pretext Tasks selection for multitask self-supervised speech representation learning | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding | Dec 16, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition Systems | Apr 1, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search Degradation | Sep 13, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| DoCIA: An Online Document-Level Context Incorporation Agent for Speech Translation | Apr 7, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects | Jun 27, 2024 | Automatic Speech RecognitionMachine Translation | CodeCode Available | 0 |
| Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain | Jun 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case Study | Jan 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Pre-training on high-resource speech recognition improves low-resource speech-to-text translation | Sep 5, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Discrete Speech Unit Extraction via Independent Component Analysis | Jan 11, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards Temporally Explainable Dysarthric Speech Clarity Assessment | May 31, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation | Oct 18, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving Voice Separation by Incorporating End-to-end Speech Recognition | Nov 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Self-Powered LLM Modality Expansion for Large Speech-Text Models | Oct 4, 2024 | Automatic Speech RecognitionInstruction Following | CodeCode Available | 0 |
| Improving RNN Transducer Modeling for End-to-End Speech Recognition | Sep 26, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| ADIMA: Abuse Detection In Multilingual Audio | Feb 16, 2022 | Abuse DetectionAutomatic Speech Recognition | CodeCode Available | 0 |
| WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation | Nov 18, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based Augmentation | Jun 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Advancing African-Accented Speech Recognition: Epistemic Uncertainty-Driven Data Selection for Generalizable ASR Models | Jun 3, 2023 | Accented Speech RecognitionActive Learning | CodeCode Available | 0 |
| Leveraging Self-Supervised Models for Automatic Whispered Speech Recognition | Jul 30, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR | Jan 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving LSTM-CTC based ASR performance in domains with limited training data | Jul 3, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving CTC-based speech recognition via knowledge transferring from pre-trained language models | Feb 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Targeted Adversarial Examples for Black Box Audio Systems | May 20, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AI | Dec 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling | Feb 7, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards Unsupervised Speech Recognition Without Pronunciation Models | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors | Nov 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding | Feb 10, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition | Jan 4, 2024 | AttributeAutomatic Speech Recognition | CodeCode Available | 0 |
| Self-supervised Speech Representations Still Struggle with African American Vernacular English | Aug 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing | Jan 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition | Jan 26, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Learning to adapt: a meta-learning approach for speaker adaptation | Aug 30, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition | Mar 22, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Writer adaptation for offline text recognition: An exploration of neural network-based methods | Jul 11, 2023 | Automatic Speech RecognitionHandwriting Recognition | CodeCode Available | 0 |
| Semantically Corrected Amharic Automatic Speech Recognition | Apr 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |