| Word Error Rate Estimation Without ASR Output: e-WER2 | Aug 8, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Text-Aware End-to-end Mispronunciation Detection and Diagnosis | Jun 15, 2022 | Contrastive LearningPhoneme Recognition | CodeCode Available | 1 | 5 |
| Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Assessment | Mar 29, 2022 | Phoneme RecognitionPseudo Label | CodeCode Available | 1 | 5 |
| WaveNet: A Generative Model for Raw Audio | Sep 12, 2016 | Audio Generationmodel | CodeCode Available | 1 | 5 |
| FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning | Jul 1, 2022 | Knowledge DistillationPhoneme Recognition | CodeCode Available | 1 | 5 |
| Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring | Sep 19, 2023 | Feature EngineeringPhone-level pronunciation scoring | CodeCode Available | 1 | 5 |
| Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes | Jun 7, 2023 | AttributeCross-Lingual Transfer | CodeCode Available | 1 | 5 |
| Attention-Based Models for Speech Recognition | Jun 24, 2015 | Machine TranslationPhoneme Recognition | CodeCode Available | 1 | 5 |
| End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks | Dec 7, 2013 | Phoneme Recognition | CodeCode Available | 0 | 5 |
| Benchmarking Generative Latent Variable Models for Speech | Feb 22, 2022 | BenchmarkingImage Generation | CodeCode Available | 0 | 5 |