| TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation | May 25, 2022 | Representation LearningRhythm | CodeCode Available | 1 | 5 |
| DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning | May 17, 2023 | ClusteringLanguage Modeling | CodeCode Available | 1 | 5 |
| Unsupervised speech representation learning using WaveNet autoencoders | Jan 25, 2019 | Acoustic Unit DiscoveryDecoder | CodeCode Available | 1 | 5 |
| Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding | Feb 27, 2023 | Model CompressionRepresentation Learning | CodeCode Available | 1 | 5 |
| A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing | Mar 18, 2022 | Representation LearningSpeaker Verification | CodeCode Available | 1 | 5 |
| data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup | Nov 2, 2022 | Automatic Speech Recognition (ASR)Language Modeling | CodeCode Available | 1 | 5 |
| Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT | Sep 16, 2024 | Acoustic Unit DiscoveryClustering | CodeCode Available | 1 | 5 |
| Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning | Sep 25, 2023 | Representation LearningSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT | Mar 29, 2022 | AllAutomatic Speech Recognition | CodeCode Available | 1 | 5 |
| Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning | Apr 8, 2022 | Contrastive LearningData Augmentation | CodeCode Available | 0 | 5 |
| A multimodal dynamical variational autoencoder for audiovisual speech representation learning | May 5, 2023 | DenoisingDisentanglement | CodeCode Available | 0 | 5 |
| Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition | Apr 1, 2022 | Phoneme RecognitionRepresentation Learning | CodeCode Available | 0 | 5 |
| Sampling strategies in Siamese Networks for unsupervised speech representation learning | Apr 30, 2018 | Representation LearningSpeech Representation Learning | CodeCode Available | 0 | 5 |
| PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations | Mar 31, 2022 | Domain AdaptationLanguage Modelling | CodeCode Available | 0 | 5 |
| Conditional independence for pretext task selection in Self-supervised speech representation learning | Apr 15, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Pretext Tasks selection for multitask self-supervised speech representation learning | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task Learning | Mar 13, 2024 | DenoisingKnowledge Distillation | CodeCode Available | 0 | 5 |
| Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders | Oct 25, 2019 | General ClassificationRepresentation Learning | CodeCode Available | 0 | 5 |
| A low latency attention module for streaming self-supervised speech representation learning | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT | Oct 5, 2021 | Multi-Task LearningRepresentation Learning | CodeCode Available | 0 | 5 |
| k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning | Nov 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| mHuBERT-147: A Compact Multilingual HuBERT Model | Jun 10, 2024 | Automatic Speech Recognition (ASR)Diversity | CodeCode Available | 0 | 5 |
| MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning | Oct 17, 2023 | DisentanglementRepresentation Learning | CodeCode Available | 0 | 5 |
| Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE | Oct 25, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 | 0 |
| Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective | Apr 5, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 | 0 |