SOTAVerified

Speech Representation Learning

Papers

Showing 51100 of 131 papers

TitleStatusHype
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information0
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction0
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach0
Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement0
Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation0
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition0
JOOCI: a Framework for Learning Comprehensive Speech Representations0
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation0
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling0
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks0
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning0
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning0
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks0
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception0
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends0
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning0
Disentangled Feature Learning for Real-Time Neural Speech Coding0
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective0
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE0
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?0
Efficiency-oriented approaches for self-supervised speech representation learning0
Label Aware Speech Representation Learning For Language Identification0
Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks0
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning0
Learning Cross-lingual Visual Speech Representations0
Learning Disentangled Speech Representations0
Learning Robust and Multilingual Speech Representations0
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation0
Towards the Next Frontier in Speech Representation Learning Using Disentanglement0
MASR: Multi-label Aware Speech Representation0
Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning0
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning0
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation0
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding0
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition0
Privacy-preserving Representation Learning for Speech Understanding0
Privacy-Preserving Speech Representation Learning using Vector Quantization0
Progressive Residual Extraction based Pre-training for Speech Representation Learning0
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition0
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion0
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective0
A Brief Overview of Unsupervised Neural Speech Representation Learning0
Wav2vec-C: A Self-supervised Model for Speech Representation Learning0
A Comparison of Discrete Latent Variable Models for Speech Representation Learning0
Robust Speaker Recognition with Transformers Using wav2vec 2.00
Robust Speech Representation Learning via Flow-based Embedding Regularization0
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation0
Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound0
Self-supervised models of audio effectively explain human cortical responses to speech0
Self-Supervised Speech Representation Learning: A Review0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.