SOTAVerified

Speech Representation Learning

Papers

Showing 150 of 131 papers

TitleStatusHype
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-SpeechCode6
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingCode3
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster PredictionCode2
Robust Self-Supervised Audio-Visual Speech RecognitionCode2
TranSpeech: Speech-to-Speech Translation With Bilateral PerturbationCode1
EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation LearningCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
FaceXHuBERT: Text-less Speech-driven E(X)pressive 3D Facial Animation Synthesis Using Self-Supervised Speech Representation LearningCode1
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation LearningCode1
Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice ConversionCode1
The Efficacy of Self-Supervised Speech Models for Audio RepresentationsCode1
Unsupervised speech representation learning using WaveNet autoencodersCode1
DeCoAR 2.0: Deep Contextualized Acoustic Representations with Vector QuantizationCode1
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech RepresentationCode1
CLARA: Multilingual Contrastive Learning for Audio Representation AcquisitionCode1
MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple TargetsCode1
Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERTCode1
An Unsupervised Autoregressive Model for Speech Representation LearningCode1
Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and UnderstandingCode1
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive LearningCode1
Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation LearningCode1
Speech SIMCLR: Combining Contrastive and Reconstruction Objective for Self-supervised Speech Representation LearningCode1
A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and EditingCode1
The Effect of Batch Size on Contrastive Self-Supervised Speech Representation LearningCode1
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled DataCode1
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-TrainingCode1
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation LearningCode1
Using Radio Archives for Low-Resource Speech Recognition: Towards an Intelligent Virtual Assistant for Illiterate UsersCode1
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at ScaleCode1
Supervised Speech Representation Learning for Parkinson's Disease ClassificationCode1
SLICER: Learning universal audio representations using low-resource self-supervised pre-trainingCode1
Fast Development of ASR in African Languages using Self Supervised Speech Representation LearningCode1
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERTCode1
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden UnitsCode1
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation LearningCode0
A multimodal dynamical variational autoencoder for audiovisual speech representation learningCode0
Deep Neural Convolutive Matrix Factorization for Articulatory Representation DecompositionCode0
Sampling strategies in Siamese Networks for unsupervised speech representation learningCode0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
Conditional independence for pretext task selection in Self-supervised speech representation learningCode0
An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task LearningCode0
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation LearningCode0
A low latency attention module for streaming self-supervised speech representation learningCode0
mHuBERT-147: A Compact Multilingual HuBERT ModelCode0
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERTCode0
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer EncodersCode0
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech RepresentationsCode0
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE0
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective0
Disentangled Feature Learning for Real-Time Neural Speech Coding0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.