SOTAVerified

Speech Representation Learning

Papers

Showing 101125 of 131 papers

TitleStatusHype
Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models0
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems0
Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning0
Characterizing the adversarial vulnerability of speech self-supervised learning0
UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation0
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling0
Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks0
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning0
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning0
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks0
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception0
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends0
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning0
Disentangled Feature Learning for Real-Time Neural Speech Coding0
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective0
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE0
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?0
Efficiency-oriented approaches for self-supervised speech representation learning0
DuRep: Dual-Mode Speech Representation Learning via ASR-Aware Distillation0
Efficient Speech Representation Learning with Low-Bit Quantization0
Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks0
Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge0
A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization0
Evaluating Self-Supervised Speech Representations for Indigenous American Languages0
Exploring wav2vec 2.0 on speaker verification and language identification0
Show:102550
← PrevPage 5 of 6Next →

No leaderboard results yet.