SOTAVerified

Speech Representation Learning

Papers

Showing 51100 of 131 papers

TitleStatusHype
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning0
MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple TargetsCode1
Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement0
ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-SpeechCode6
SLICER: Learning universal audio representations using low-resource self-supervised pre-trainingCode1
data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setupCode1
Application of Knowledge Distillation to Multi-task Speech Representation Learning0
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive LearningCode1
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE0
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach0
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning0
Experiments on Turkish ASR with Self-Supervised Speech Representation Learning0
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding0
The Efficacy of Self-Supervised Speech Models for Audio RepresentationsCode1
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE0
Self-supervised models of audio effectively explain human cortical responses to speech0
TranSpeech: Speech-to-Speech Translation With Bilateral PerturbationCode1
Self-Supervised Speech Representation Learning: A Review0
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation0
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation LearningCode0
Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning0
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective0
Deep Neural Convolutive Matrix Factorization for Articulatory Representation DecompositionCode0
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech RepresentationsCode0
Robust Disentangled Variational Speech Representation Learning for Zero-shot Voice ConversionCode1
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERTCode1
Robust Speaker Recognition with Transformers Using wav2vec 2.00
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation0
XTREME-S: Evaluating Cross-lingual Speech Representations0
A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and EditingCode1
Privacy-Preserving Speech Representation Learning using Vector Quantization0
Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks0
A Brief Overview of Unsupervised Neural Speech Representation Learning0
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition0
A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization0
Robust Self-Supervised Audio-Visual Speech RecognitionCode2
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster PredictionCode2
Robust Speech Representation Learning via Flow-based Embedding Regularization0
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at ScaleCode1
Characterizing the adversarial vulnerability of speech self-supervised learning0
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction0
Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning0
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks0
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-TrainingCode1
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERTCode0
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-TrainingCode3
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden UnitsCode1
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.