Speech Representation Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 76–100 of 131 papers

Title	Date	Tasks	Status
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective	Apr 5, 2022	DisentanglementRepresentation Learning	—Unverified
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE	Oct 25, 2022	DisentanglementRepresentation Learning	—Unverified
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?	May 4, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Efficiency-oriented approaches for self-supervised speech representation learning	Dec 18, 2023	Automatic Speech RecognitionRepresentation Learning	—Unverified
MASR: Multi-label Aware Speech Representation	Jul 20, 2023	Emotion RecognitionLanguage Identification	—Unverified
Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning	Oct 28, 2019	ClusteringPhoneme Recognition	—Unverified
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning	Nov 21, 2022	Audio-Visual Speech RecognitionLanguage Modelling	—Unverified
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation	Jul 6, 2023	Keyword SpottingKnowledge Distillation	—Unverified
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding	Oct 11, 2022	Representation LearningSentence	—Unverified
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition	Jun 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Privacy-preserving Representation Learning for Speech Understanding	Oct 26, 2023	ClassificationEmotion Recognition	—Unverified
Privacy-Preserving Speech Representation Learning using Vector Quantization	Mar 15, 2022	Privacy PreservingQuantization	—Unverified
Progressive Residual Extraction based Pre-training for Speech Representation Learning	Aug 31, 2024	Emotion RecognitionRepresentation Learning	—Unverified
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition	May 23, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion	Nov 14, 2023	Deep LearningDiversity	—Unverified
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective	Jan 16, 2024	Representation LearningSelf-Supervised Learning	—Unverified
A Brief Overview of Unsupervised Neural Speech Representation Learning	Mar 1, 2022	Representation LearningSpeech Representation Learning	—Unverified
Wav2vec-C: A Self-supervised Model for Speech Representation Learning	Mar 9, 2021	QuantizationRepresentation Learning	—Unverified
A Comparison of Discrete Latent Variable Models for Speech Representation Learning	Oct 24, 2020	Phoneme RecognitionRepresentation Learning	—Unverified
Robust Speaker Recognition with Transformers Using wav2vec 2.0	Mar 28, 2022	Data AugmentationRepresentation Learning	—Unverified
Robust Speech Representation Learning via Flow-based Embedding Regularization	Dec 7, 2021	Deep LearningLanguage Identification	—Unverified
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation	May 17, 2022	Representation LearningRetrieval	—Unverified
Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound	Aug 14, 2020	Contrastive LearningGaze Prediction	—Unverified
Self-supervised models of audio effectively explain human cortical responses to speech	May 27, 2022	Representation LearningSpeech Representation Learning	—Unverified
Self-Supervised Speech Representation Learning: A Review	May 21, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified

Show:10 25 50

← PrevPage 4 of 6Next →

No leaderboard results yet.