Speech Representation Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 131 papers

Title	Date	Tasks	Status
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information	Dec 7, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction	Oct 28, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach	Oct 25, 2022	Representation LearningSpeaker Recognition	—Unverified
Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement	Nov 12, 2022	Data AugmentationEmotion Recognition	—Unverified
Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation	Jun 17, 2019	ClusteringRepresentation Learning	—Unverified
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition	May 25, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
JOOCI: a Framework for Learning Comprehensive Speech Representations	Oct 14, 2024	Representation LearningSpeech Representation Learning	—Unverified
k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning	Nov 26, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Label Aware Speech Representation Learning For Language Identification	Jun 7, 2023	Language IdentificationMissing Labels	—Unverified
Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks	Mar 9, 2022	Representation Learningspeech-recognition	—Unverified
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning	Jun 3, 2020	Representation LearningSelf-Supervised Learning	—Unverified
Learning Cross-lingual Visual Speech Representations	Mar 14, 2023	Representation LearningSelf-Supervised Learning	—Unverified
Learning Disentangled Speech Representations	Nov 4, 2023	BenchmarkingDisentanglement	—Unverified
Learning Robust and Multilingual Speech Representations	Jan 29, 2020	Representation Learningspeech-recognition	—Unverified
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation	Mar 24, 2022	Representation LearningSpeech Representation Learning	—Unverified
Towards the Next Frontier in Speech Representation Learning Using Disentanglement	Jul 2, 2024	DisentanglementRepresentation Learning	—Unverified
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling	Jun 17, 2019	Representation LearningSpeech Representation Learning	—Unverified
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks	Oct 14, 2021	Audio ClassificationRepresentation Learning	—Unverified
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning	Jul 26, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning	Jun 4, 2020	BIG-bench Machine LearningContrastive Learning	—Unverified
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks	Oct 23, 2019	Representation LearningSpeech Representation Learning	—Unverified
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception	Mar 21, 2024	Audio-Visual Speech RecognitionRepresentation Learning	—Unverified
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends	Jan 2, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning	Oct 16, 2022	Audio GenerationRepresentation Learning	—Unverified
Disentangled Feature Learning for Real-Time Neural Speech Coding	Nov 22, 2022	DisentanglementRepresentation Learning	—Unverified
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective	Apr 5, 2022	DisentanglementRepresentation Learning	—Unverified
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE	Oct 25, 2022	DisentanglementRepresentation Learning	—Unverified
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?	May 4, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Efficiency-oriented approaches for self-supervised speech representation learning	Dec 18, 2023	Automatic Speech RecognitionRepresentation Learning	—Unverified
MASR: Multi-label Aware Speech Representation	Jul 20, 2023	Emotion RecognitionLanguage Identification	—Unverified
Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning	Oct 28, 2019	ClusteringPhoneme Recognition	—Unverified
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning	Nov 21, 2022	Audio-Visual Speech RecognitionLanguage Modelling	—Unverified
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation	Jul 6, 2023	Keyword SpottingKnowledge Distillation	—Unverified
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding	Oct 11, 2022	Representation LearningSentence	—Unverified
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition	Jun 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Privacy-preserving Representation Learning for Speech Understanding	Oct 26, 2023	ClassificationEmotion Recognition	—Unverified
Privacy-Preserving Speech Representation Learning using Vector Quantization	Mar 15, 2022	Privacy PreservingQuantization	—Unverified
Progressive Residual Extraction based Pre-training for Speech Representation Learning	Aug 31, 2024	Emotion RecognitionRepresentation Learning	—Unverified
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition	May 23, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion	Nov 14, 2023	Deep LearningDiversity	—Unverified
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective	Jan 16, 2024	Representation LearningSelf-Supervised Learning	—Unverified
A Brief Overview of Unsupervised Neural Speech Representation Learning	Mar 1, 2022	Representation LearningSpeech Representation Learning	—Unverified
Wav2vec-C: A Self-supervised Model for Speech Representation Learning	Mar 9, 2021	QuantizationRepresentation Learning	—Unverified
A Comparison of Discrete Latent Variable Models for Speech Representation Learning	Oct 24, 2020	Phoneme RecognitionRepresentation Learning	—Unverified
Robust Speaker Recognition with Transformers Using wav2vec 2.0	Mar 28, 2022	Data AugmentationRepresentation Learning	—Unverified
Robust Speech Representation Learning via Flow-based Embedding Regularization	Dec 7, 2021	Deep LearningLanguage Identification	—Unverified
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation	May 17, 2022	Representation LearningRetrieval	—Unverified
Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound	Aug 14, 2020	Contrastive LearningGaze Prediction	—Unverified
Self-supervised models of audio effectively explain human cortical responses to speech	May 27, 2022	Representation LearningSpeech Representation Learning	—Unverified
Self-Supervised Speech Representation Learning: A Review	May 21, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified

Show:10 25 50

← PrevPage 2 of 3Next →

No leaderboard results yet.