| CPPF: A contextual and post-processing-free model for automatic speech recognition | Sep 14, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks | Sep 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PromptASR for contextualized ASR with controllable style | Sep 14, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 2 |
| Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation | Sep 14, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| EnCodecMAE: Leveraging neural codecs for universal audio representation learning | Sep 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec | Sep 14, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 2 |
| Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis | Sep 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Open-vocabulary Keyword-spotting with Adaptive Instance Normalization | Sep 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Can Whisper perform speech-based in-context learning? | Sep 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method | Sep 12, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |