Speech Representation Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–131 of 131 papers

Title	Date	Tasks	Status
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems	Feb 17, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning	Apr 8, 2022	Representation LearningSelf-Supervised Learning	—Unverified
Characterizing the adversarial vulnerability of speech self-supervised learning	Nov 8, 2021	Adversarial RobustnessBenchmarking	—Unverified
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation	Mar 2, 2025	DecoderRepresentation Learning	—Unverified
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling	Jun 17, 2019	Representation LearningSpeech Representation Learning	—Unverified
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks	Oct 14, 2021	Audio ClassificationRepresentation Learning	—Unverified
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning	Jul 26, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning	Jun 4, 2020	BIG-bench Machine LearningContrastive Learning	—Unverified
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks	Oct 23, 2019	Representation LearningSpeech Representation Learning	—Unverified
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception	Mar 21, 2024	Audio-Visual Speech RecognitionRepresentation Learning	—Unverified
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends	Jan 2, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning	Oct 16, 2022	Audio GenerationRepresentation Learning	—Unverified
Disentangled Feature Learning for Real-Time Neural Speech Coding	Nov 22, 2022	DisentanglementRepresentation Learning	—Unverified
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective	Apr 5, 2022	DisentanglementRepresentation Learning	—Unverified
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE	Oct 25, 2022	DisentanglementRepresentation Learning	—Unverified
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?	May 4, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Efficiency-oriented approaches for self-supervised speech representation learning	Dec 18, 2023	Automatic Speech RecognitionRepresentation Learning	—Unverified
DuRep: Dual-Mode Speech Representation Learning via ASR-Aware Distillation	May 26, 2025	Representation LearningSpeech Representation Learning	—Unverified
Efficient Speech Representation Learning with Low-Bit Quantization	Dec 14, 2022	Model CompressionQuantization	—Unverified
Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks	Apr 1, 2021	DecoderRepresentation Learning	—Unverified
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge	Jun 10, 2024	Representation LearningSelf-Supervised Learning	—Unverified
A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization	Jan 16, 2022	Phoneme RecognitionRepresentation Learning	—Unverified
Evaluating Self-Supervised Speech Representations for Indigenous American Languages	Oct 5, 2023	Representation LearningSpeech Representation Learning	—Unverified
Exploring wav2vec 2.0 on speaker verification and language identification	Dec 11, 2020	Language IdentificationMulti-Task Learning	—Unverified
XTREME-S: Evaluating Cross-lingual Speech Representations	Mar 21, 2022	Representation LearningRetrieval	—Unverified
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE	Jun 6, 2022	Representation LearningSpeech Representation Learning	—Unverified
Towards Learning Fine-Grained Disentangled Representations from Speech	Aug 8, 2018	Representation LearningSpeech Representation Learning	—Unverified
Flowchase: a Mobile Application for Pronunciation Training	Jul 5, 2023	Representation LearningSpeech Representation Learning	—Unverified
General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework	Feb 3, 2021	ClassificationEmotion Classification	—Unverified
Input-independent Attention Weights Are Expressive Enough: A Study of Attention in Self-supervised Audio Transformers	Jun 9, 2020	General ClassificationRepresentation Learning	—Unverified
Towards Robust Speech Representation Learning for Thousands of Languages	Jun 30, 2024	Representation LearningSelf-Supervised Learning	—Unverified

Show:10 25 50

← PrevPage 3 of 3Next →

No leaderboard results yet.