| LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Refining Self-Supervised Learnt Speech Representation using Brain Activations | Jun 12, 2024 | Automatic Speech RecognitionSpeaker Verification | —Unverified | 0 |
| Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion Techniques | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Transformer-based Model for ASR N-Best Rescoring and Rewriting | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR | Jun 12, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards Unsupervised Speech Recognition Without Pronunciation Models | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Tag and correct: high precision post-editing approach to correction of speech recognition errors | Jun 11, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Reading Miscue Detection in Primary School through Automatic Speech Recognition | Jun 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter | Jun 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection | Jun 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ASTRA: Aligning Speech and Text Representations for Asr without Sampling | Jun 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations | Jun 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR | Jun 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis | Jun 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Flexible Multichannel Speech Enhancement for Noise-Robust Frontend | Jun 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| To Distill or Not to Distill? On the Robustness of Robust Knowledge Distillation | Jun 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Hypernetworks for Personalizing ASR to Atypical Speech | Jun 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition | Jun 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |