| Improving Mandarin Speech Recogntion with Block-augmented Transformer | Jul 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription | Jul 20, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| MM-ALT: A Multimodal Automatic Lyric Transcription System | Jul 13, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Distilling a Pretrained Language Model to a Multilingual ASR Model | Jun 25, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement | Jun 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| AVATAR: Unconstrained Audiovisual Speech Recognition | Jun 15, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| LAE: Language-Aware Encoder for Monolingual and Multilingual ASR | Jun 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners | May 22, 2022 | AttributeAutomatic Speech Recognition | CodeCode Available | 1 |
| Vietnamese Automatic Speech Recognition using Wav2vec 2.0 | May 8, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment | May 6, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |