Speech Representation Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 131 papers

Title	Date	Tasks	Status
Self-supervised speech representation learning for keyword-spotting with light-weight transformers	Mar 7, 2023	Keyword SpottingRepresentation Learning	—Unverified
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization	Jan 26, 2024	DecoderDomain Adaptation	—Unverified
Similarity Analysis of Self-Supervised Speech Representations	Oct 22, 2020	Representation LearningSpeech Representation Learning	—Unverified
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System	Jun 5, 2023	Multi-Task LearningRepresentation Learning	—Unverified
Universal Semantic Disentangled Privacy-preserving Speech Representation Learning	May 19, 2025	DecoderPrivacy Preserving	—Unverified
Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio	Oct 17, 2023	Representation LearningSelf-Supervised Learning	—Unverified
Speech representation learning: Learning bidirectional encoders with single-view, multi-view, and multi-task methods	Jul 25, 2023	MULTI-VIEW LEARNINGRepresentation Learning	—Unverified
Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation	Aug 20, 2024	Data AugmentationRepresentation Learning	—Unverified
Adversarially learning disentangled speech representations for robust multi-factor voice conversion	Jan 30, 2021	Representation LearningRhythm	—Unverified
An empirical study on speech restoration guided by self supervised speech representation	May 30, 2023	Representation LearningSpeech Representation Learning	—Unverified
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition	Jan 22, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning	Oct 18, 2021	Multi-Task LearningRepresentation Learning	—Unverified
Application of Knowledge Distillation to Multi-task Speech Representation Learning	Oct 29, 2022	Keyword SpottingKnowledge Distillation	—Unverified
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models	Sep 21, 2024	DeepFake DetectionFace Swapping	—Unverified
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems	Feb 17, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning	Apr 8, 2022	Representation LearningSelf-Supervised Learning	—Unverified
Characterizing the adversarial vulnerability of speech self-supervised learning	Nov 8, 2021	Adversarial RobustnessBenchmarking	—Unverified
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation	Mar 2, 2025	DecoderRepresentation Learning	—Unverified
A low latency attention module for streaming self-supervised speech representation learning	Feb 27, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Conditional independence for pretext task selection in Self-supervised speech representation learning	Apr 15, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
mHuBERT-147: A Compact Multilingual HuBERT Model	Jun 10, 2024	Automatic Speech Recognition (ASR)Diversity	CodeCode Available
Mockingjay: Unsupervised Speech Representation Learning with Deep Bidirectional Transformer Encoders	Oct 25, 2019	General ClassificationRepresentation Learning	CodeCode Available
DistilHuBERT: Speech Representation Learning by Layer-wise Distillation of Hidden-unit BERT	Oct 5, 2021	Multi-Task LearningRepresentation Learning	CodeCode Available
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning	Apr 8, 2022	Contrastive LearningData Augmentation	CodeCode Available
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning	Oct 17, 2023	DisentanglementRepresentation Learning	CodeCode Available

Show:10 25 50

← PrevPage 5 of 6Next →

No leaderboard results yet.