| Self-Supervised Learning for Multi-Channel Neural Transducer | Aug 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion | Aug 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data | Aug 1, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation | Aug 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition | Jul 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards interfacing large language models with ASR systems using confidence measures and prompting | Jul 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Self-Supervised Models for Automatic Whispered Speech Recognition | Jul 30, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses | Jul 26, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions | Jul 25, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Scaling A Simple Approach to Zero-Shot Speech Recognition | Jul 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |