| FaceXHuBERT: Text-less Speech-driven E(X)pressive 3D Facial Animation Synthesis Using Self-Supervised Speech Representation Learning | Mar 9, 2023 | 3D Face AnimationRepresentation Learning | CodeCode Available | 1 | 5 |
| Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning | Mar 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units | Jun 14, 2021 | ClusteringLanguage Modelling | CodeCode Available | 1 | 5 |
| QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning | Aug 31, 2023 | Representation LearningSpeech Representation Learning | CodeCode Available | 1 | 5 |
| CLARA: Multilingual Contrastive Learning for Audio Representation Acquisition | Oct 18, 2023 | Audio ClassificationContrastive Learning | CodeCode Available | 1 | 5 |
| data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup | Nov 2, 2022 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 1 | 5 |
| A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing | Mar 18, 2022 | Representation LearningSpeaker Verification | CodeCode Available | 1 | 5 |
| DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning | May 17, 2023 | ClusteringLanguage Modeling | CodeCode Available | 1 | 5 |
| EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning | Oct 17, 2024 | Representation LearningSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector Quantization | Dec 11, 2020 | DiversityQuantization | CodeCode Available | 1 | 5 |