Speech Representation Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–125 of 131 papers

Title	Date	Tasks	Status
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models	Sep 21, 2024	DeepFake DetectionFace Swapping	—Unverified
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems	Feb 17, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning	Apr 8, 2022	Representation LearningSelf-Supervised Learning	—Unverified
Characterizing the adversarial vulnerability of speech self-supervised learning	Nov 8, 2021	Adversarial RobustnessBenchmarking	—Unverified
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation	Mar 2, 2025	DecoderRepresentation Learning	—Unverified
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling	Jun 17, 2019	Representation LearningSpeech Representation Learning	—Unverified
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks	Oct 14, 2021	Audio ClassificationRepresentation Learning	—Unverified
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning	Jul 26, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning	Jun 4, 2020	BIG-bench Machine LearningContrastive Learning	—Unverified
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks	Oct 23, 2019	Representation LearningSpeech Representation Learning	—Unverified
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception	Mar 21, 2024	Audio-Visual Speech RecognitionRepresentation Learning	—Unverified
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends	Jan 2, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning	Oct 16, 2022	Audio GenerationRepresentation Learning	—Unverified
Disentangled Feature Learning for Real-Time Neural Speech Coding	Nov 22, 2022	DisentanglementRepresentation Learning	—Unverified
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective	Apr 5, 2022	DisentanglementRepresentation Learning	—Unverified
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE	Oct 25, 2022	DisentanglementRepresentation Learning	—Unverified
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?	May 4, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Efficiency-oriented approaches for self-supervised speech representation learning	Dec 18, 2023	Automatic Speech RecognitionRepresentation Learning	—Unverified
DuRep: Dual-Mode Speech Representation Learning via ASR-Aware Distillation	May 26, 2025	Representation LearningSpeech Representation Learning	—Unverified
Efficient Speech Representation Learning with Low-Bit Quantization	Dec 14, 2022	Model CompressionQuantization	—Unverified
Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks	Apr 1, 2021	DecoderRepresentation Learning	—Unverified
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge	Jun 10, 2024	Representation LearningSelf-Supervised Learning	—Unverified
A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization	Jan 16, 2022	Phoneme RecognitionRepresentation Learning	—Unverified
Evaluating Self-Supervised Speech Representations for Indigenous American Languages	Oct 5, 2023	Representation LearningSpeech Representation Learning	—Unverified
Exploring wav2vec 2.0 on speaker verification and language identification	Dec 11, 2020	Language IdentificationMulti-Task Learning	—Unverified

Show:10 25 50

← PrevPage 5 of 6Next →

No leaderboard results yet.