Speech Representation Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 131 papers

Title	Date	Tasks	Status
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective	Jan 16, 2024	Representation LearningSelf-Supervised Learning	—Unverified
Efficiency-oriented approaches for self-supervised speech representation learning	Dec 18, 2023	Automatic Speech RecognitionRepresentation Learning	—Unverified
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion	Nov 14, 2023	Deep LearningDiversity	—Unverified
Learning Disentangled Speech Representations	Nov 4, 2023	BenchmarkingDisentanglement	—Unverified
Privacy-preserving Representation Learning for Speech Understanding	Oct 26, 2023	ClassificationEmotion Recognition	—Unverified
Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio	Oct 17, 2023	Representation LearningSelf-Supervised Learning	—Unverified
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning	Oct 17, 2023	DisentanglementRepresentation Learning	CodeCode Available
Evaluating Self-Supervised Speech Representations for Indigenous American Languages	Oct 5, 2023	Representation LearningSpeech Representation Learning	—Unverified
Speech representation learning: Learning bidirectional encoders with single-view, multi-view, and multi-task methods	Jul 25, 2023	MULTI-VIEW LEARNINGRepresentation Learning	—Unverified
MASR: Multi-label Aware Speech Representation	Jul 20, 2023	Emotion RecognitionLanguage Identification	—Unverified
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation	Jul 6, 2023	Keyword SpottingKnowledge Distillation	—Unverified
Flowchase: a Mobile Application for Pronunciation Training	Jul 5, 2023	Representation LearningSpeech Representation Learning	—Unverified
Label Aware Speech Representation Learning For Language Identification	Jun 7, 2023	Language IdentificationMissing Labels	—Unverified
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System	Jun 5, 2023	Multi-Task LearningRepresentation Learning	—Unverified
An empirical study on speech restoration guided by self supervised speech representation	May 30, 2023	Representation LearningSpeech Representation Learning	—Unverified
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition	May 25, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition	May 23, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A multimodal dynamical variational autoencoder for audiovisual speech representation learning	May 5, 2023	DenoisingDisentanglement	CodeCode Available
Learning Cross-lingual Visual Speech Representations	Mar 14, 2023	Representation LearningSelf-Supervised Learning	—Unverified
Self-supervised speech representation learning for keyword-spotting with light-weight transformers	Mar 7, 2023	Keyword SpottingRepresentation Learning	—Unverified
A low latency attention module for streaming self-supervised speech representation learning	Feb 27, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Efficient Speech Representation Learning with Low-Bit Quantization	Dec 14, 2022	Model CompressionQuantization	—Unverified
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information	Dec 7, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Disentangled Feature Learning for Real-Time Neural Speech Coding	Nov 22, 2022	DisentanglementRepresentation Learning	—Unverified
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning	Nov 21, 2022	Audio-Visual Speech RecognitionLanguage Modelling	—Unverified
Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement	Nov 12, 2022	Data AugmentationEmotion Recognition	—Unverified
Application of Knowledge Distillation to Multi-task Speech Representation Learning	Oct 29, 2022	Keyword SpottingKnowledge Distillation	—Unverified
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE	Oct 25, 2022	DisentanglementRepresentation Learning	—Unverified
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach	Oct 25, 2022	Representation LearningSpeaker Recognition	—Unverified
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning	Oct 16, 2022	Audio GenerationRepresentation Learning	—Unverified
Experiments on Turkish ASR with Self-Supervised Speech Representation Learning	Oct 13, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding	Oct 11, 2022	Representation LearningSentence	—Unverified
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE	Jun 6, 2022	Representation LearningSpeech Representation Learning	—Unverified
Self-supervised models of audio effectively explain human cortical responses to speech	May 27, 2022	Representation LearningSpeech Representation Learning	—Unverified
Self-Supervised Speech Representation Learning: A Review	May 21, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation	May 17, 2022	Representation LearningRetrieval	—Unverified
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning	Apr 8, 2022	Contrastive LearningData Augmentation	CodeCode Available
Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning	Apr 8, 2022	Representation LearningSelf-Supervised Learning	—Unverified
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective	Apr 5, 2022	DisentanglementRepresentation Learning	—Unverified
Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition	Apr 1, 2022	Phoneme RecognitionRepresentation Learning	CodeCode Available
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations	Mar 31, 2022	Domain AdaptationLanguage Modelling	CodeCode Available
Robust Speaker Recognition with Transformers Using wav2vec 2.0	Mar 28, 2022	Data AugmentationRepresentation Learning	—Unverified
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation	Mar 24, 2022	Representation LearningSpeech Representation Learning	—Unverified
XTREME-S: Evaluating Cross-lingual Speech Representations	Mar 21, 2022	Representation LearningRetrieval	—Unverified
Privacy-Preserving Speech Representation Learning using Vector Quantization	Mar 15, 2022	Privacy PreservingQuantization	—Unverified
Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks	Mar 9, 2022	Representation Learningspeech-recognition	—Unverified
A Brief Overview of Unsupervised Neural Speech Representation Learning	Mar 1, 2022	Representation LearningSpeech Representation Learning	—Unverified
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition	Jan 22, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization	Jan 16, 2022	Phoneme RecognitionRepresentation Learning	—Unverified
Robust Speech Representation Learning via Flow-based Embedding Regularization	Dec 7, 2021	Deep LearningLanguage Identification	—Unverified

Show:10 25 50

← PrevPage 2 of 3Next →

No leaderboard results yet.