| Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer | May 15, 2024 | Adversarial AttackAutomatic Speech Recognition | —Unverified | 0 |
| Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants | May 14, 2024 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| SpeechVerse: A Large-scale Generalizable Audio Language Model | May 14, 2024 | Automatic Speech RecognitionBenchmarking | —Unverified | 0 |
| SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset | May 12, 2024 | Action SpottingAutomatic Speech Recognition | CodeCode Available | 1 |
| Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech | May 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models | May 9, 2024 | Adversarial AttackAutomatic Speech Recognition | CodeCode Available | 1 |
| Open Implementation and Study of BEST-RQ for Speech Processing | May 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition | May 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition | May 3, 2024 | Active LearningAutomatic Speech Recognition | —Unverified | 0 |
| Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets | May 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Sequence-to-sequence models in peer-to-peer learning: A practical application | May 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Compression of Multitask Multilingual Speech Models | May 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features | May 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Does Whisper understand Swiss German? An automatic, qualitative, and human evaluation | Apr 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Speech Recognition System-Independent Word Error Rate Estimation | Apr 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF | Apr 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Developing Acoustic Models for Automatic Speech Recognition in Swedish | Apr 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices | Apr 24, 2024 | Automatic Speech RecognitionCPU | —Unverified | 0 |
| Breaking Walls: Pioneering Automatic Speech Recognition for Central Kurdish: End-to-End Transformer Paradigm | Apr 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance | Apr 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information | Apr 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Less Peaky and More Accurate CTC Forced Alignment by Label Priors | Apr 22, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Semantically Corrected Amharic Automatic Speech Recognition | Apr 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Efficient infusion of self-supervised representations in Automatic Speech Recognition | Apr 19, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech | Apr 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |