Speech Representation Learning

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 131 papers

Title	Date	Tasks	Status	Hype
CLARA: Multilingual Contrastive Learning for Audio Representation Acquisition	Oct 18, 2023	Audio ClassificationContrastive Learning	CodeCode Available	1
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning	Oct 17, 2023	DisentanglementRepresentation Learning	CodeCode Available	0
Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio	Oct 17, 2023	Representation LearningSelf-Supervised Learning	—Unverified	0
Evaluating Self-Supervised Speech Representations for Indigenous American Languages	Oct 5, 2023	Representation LearningSpeech Representation Learning	—Unverified	0
Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning	Sep 25, 2023	Representation LearningSelf-Supervised Learning	CodeCode Available	1
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation Learning	Aug 31, 2023	Representation LearningSpeech Representation Learning	CodeCode Available	1
Speech representation learning: Learning bidirectional encoders with single-view, multi-view, and multi-task methods	Jul 25, 2023	MULTI-VIEW LEARNINGRepresentation Learning	—Unverified	0
MASR: Multi-label Aware Speech Representation	Jul 20, 2023	Emotion RecognitionLanguage Identification	—Unverified	0
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation	Jul 6, 2023	Keyword SpottingKnowledge Distillation	—Unverified	0
Flowchase: a Mobile Application for Pronunciation Training	Jul 5, 2023	Representation LearningSpeech Representation Learning	—Unverified	0
Label Aware Speech Representation Learning For Language Identification	Jun 7, 2023	Language IdentificationMissing Labels	—Unverified	0
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System	Jun 5, 2023	Multi-Task LearningRepresentation Learning	—Unverified	0
An empirical study on speech restoration guided by self supervised speech representation	May 30, 2023	Representation LearningSpeech Representation Learning	—Unverified	0
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition	May 25, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition	May 23, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning	May 17, 2023	ClusteringLanguage Modeling	CodeCode Available	1
A multimodal dynamical variational autoencoder for audiovisual speech representation learning	May 5, 2023	DenoisingDisentanglement	CodeCode Available	0
Learning Cross-lingual Visual Speech Representations	Mar 14, 2023	Representation LearningSelf-Supervised Learning	—Unverified	0
FaceXHuBERT: Text-less Speech-driven E(X)pressive 3D Facial Animation Synthesis Using Self-Supervised Speech Representation Learning	Mar 9, 2023	3D Face AnimationRepresentation Learning	CodeCode Available	1
Self-supervised speech representation learning for keyword-spotting with light-weight transformers	Mar 7, 2023	Keyword SpottingRepresentation Learning	—Unverified	0
Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding	Feb 27, 2023	Model CompressionRepresentation Learning	CodeCode Available	1
A low latency attention module for streaming self-supervised speech representation learning	Feb 27, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0
Efficient Speech Representation Learning with Low-Bit Quantization	Dec 14, 2022	Model CompressionQuantization	—Unverified	0
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information	Dec 7, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Disentangled Feature Learning for Real-Time Neural Speech Coding	Nov 22, 2022	DisentanglementRepresentation Learning	—Unverified	0

Show:10 25 50

← PrevPage 2 of 6Next →

No leaderboard results yet.