| Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning | Jun 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities | May 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation | May 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition | May 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients | May 27, 2024 | Automatic Speech RecognitionFederated Learning | CodeCode Available | 0 |
| Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition | May 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| Contextualized Automatic Speech Recognition with Dynamic Vocabulary | May 22, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |