| AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR | Mar 29, 2023 | Automatic Speech RecognitionDomain Adaptation | —Unverified | 0 |
| Joint unsupervised and supervised learning for context-aware language identification | Mar 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP | Mar 28, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 1 |
| Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis | Mar 27, 2023 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels | Mar 25, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 2 |
| Enhancing Unsupervised Speech Recognition with Diffusion GANs | Mar 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition | Mar 23, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition | Mar 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-supervised Learning with Speech Modulation Dropout | Mar 22, 2023 | Automatic Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| Transformers in Speech Processing: A Survey | Mar 21, 2023 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |