SOTAVerified

Speech Representation Learning

Papers

Showing 150 of 131 papers

TitleStatusHype
HYFuse: Aligning Heterogeneous Speech Pre-Trained Representations in Hyperbolic Space for Speech Emotion Recognition0
DuRep: Dual-Mode Speech Representation Learning via ASR-Aware Distillation0
Universal Semantic Disentangled Privacy-preserving Speech Representation Learning0
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation0
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech RepresentationCode1
k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation LearningCode0
EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation LearningCode1
JOOCI: a Framework for Learning Comprehensive Speech Representations0
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models0
Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERTCode1
Progressive Residual Extraction based Pre-training for Speech Representation Learning0
Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation0
Towards the Next Frontier in Speech Representation Learning Using Disentanglement0
Towards Robust Speech Representation Learning for Thousands of Languages0
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge0
mHuBERT-147: A Compact Multilingual HuBERT ModelCode0
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception0
An Efficient End-to-End Approach to Noise Invariant Speech Features via Multi-Task LearningCode0
The Effect of Batch Size on Contrastive Self-Supervised Speech Representation LearningCode1
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization0
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective0
Efficiency-oriented approaches for self-supervised speech representation learning0
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion0
Learning Disentangled Speech Representations0
Privacy-preserving Representation Learning for Speech Understanding0
CLARA: Multilingual Contrastive Learning for Audio Representation AcquisitionCode1
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation LearningCode0
Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio0
Evaluating Self-Supervised Speech Representations for Indigenous American Languages0
Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation LearningCode1
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation LearningCode1
Speech representation learning: Learning bidirectional encoders with single-view, multi-view, and multi-task methods0
MASR: Multi-label Aware Speech Representation0
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation0
Flowchase: a Mobile Application for Pronunciation Training0
Label Aware Speech Representation Learning For Language Identification0
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System0
An empirical study on speech restoration guided by self supervised speech representation0
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition0
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition0
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation LearningCode1
A multimodal dynamical variational autoencoder for audiovisual speech representation learningCode0
Learning Cross-lingual Visual Speech Representations0
FaceXHuBERT: Text-less Speech-driven E(X)pressive 3D Facial Animation Synthesis Using Self-Supervised Speech Representation LearningCode1
Self-supervised speech representation learning for keyword-spotting with light-weight transformers0
Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and UnderstandingCode1
A low latency attention module for streaming self-supervised speech representation learningCode0
Efficient Speech Representation Learning with Low-Bit Quantization0
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information0
Disentangled Feature Learning for Real-Time Neural Speech Coding0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.