| LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data | Dec 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| FastInject: Injecting Unpaired Text Data into CTC-based ASR training | Dec 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio-visual fine-tuning of audio-only ASR models | Dec 14, 2023 | Automatic Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models | Dec 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition | Dec 13, 2023 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Extending Whisper with prompt tuning to target-speaker ASR | Dec 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification | Dec 12, 2023 | Automatic Speech RecognitionDialect Identification | —Unverified | 0 |
| Creating Spoken Dialog Systems in Ultra-Low Resourced Settings | Dec 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning | Dec 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models | Dec 6, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition | Dec 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bigger is not Always Better: The Effect of Context Size on Speech Pre-Training | Dec 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| End-to-End Speech-to-Text Translation: A Survey | Dec 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data | Nov 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| D4AM: A General Denoising Framework for Downstream Acoustic Models | Nov 28, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors | Nov 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR | Nov 24, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Soft Random Sampling: A Theoretical and Empirical Analysis | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| App for Resume-Based Job Matching with Speech Interviews and Grammar Analysis: A Review | Nov 20, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| How does end-to-end speech recognition training impact speech enhancement artifacts? | Nov 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding | Nov 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition | Nov 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-channel Conversational Speaker Separation via Neural Diarization | Nov 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer | Nov 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |