SOTAVerified

Speech Representation Learning

Papers

Showing 51100 of 131 papers

TitleStatusHype
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective0
Efficiency-oriented approaches for self-supervised speech representation learning0
Reimagining Speech: A Scoping Review of Deep Learning-Powered Voice Conversion0
Learning Disentangled Speech Representations0
Privacy-preserving Representation Learning for Speech Understanding0
Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio0
MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation LearningCode0
Evaluating Self-Supervised Speech Representations for Indigenous American Languages0
Speech representation learning: Learning bidirectional encoders with single-view, multi-view, and multi-task methods0
MASR: Multi-label Aware Speech Representation0
On-Device Constrained Self-Supervised Speech Representation Learning for Keyword Spotting via Knowledge Distillation0
Flowchase: a Mobile Application for Pronunciation Training0
Label Aware Speech Representation Learning For Language Identification0
Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System0
An empirical study on speech restoration guided by self supervised speech representation0
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition0
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition0
A multimodal dynamical variational autoencoder for audiovisual speech representation learningCode0
Learning Cross-lingual Visual Speech Representations0
Self-supervised speech representation learning for keyword-spotting with light-weight transformers0
A low latency attention module for streaming self-supervised speech representation learningCode0
Efficient Speech Representation Learning with Low-Bit Quantization0
Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information0
Disentangled Feature Learning for Real-Time Neural Speech Coding0
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning0
Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement0
Application of Knowledge Distillation to Multi-task Speech Representation Learning0
Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE0
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach0
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning0
Experiments on Turkish ASR with Self-Supervised Speech Representation Learning0
On the Use of Semantically-Aligned Speech Representations for Spoken Language Understanding0
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE0
Self-supervised models of audio effectively explain human cortical responses to speech0
Self-Supervised Speech Representation Learning: A Review0
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation0
Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation LearningCode0
Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning0
Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective0
Deep Neural Convolutive Matrix Factorization for Articulatory Representation DecompositionCode0
PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech RepresentationsCode0
Robust Speaker Recognition with Transformers Using wav2vec 2.00
Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation0
XTREME-S: Evaluating Cross-lingual Speech Representations0
Privacy-Preserving Speech Representation Learning using Vector Quantization0
Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks0
A Brief Overview of Unsupervised Neural Speech Representation Learning0
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition0
A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization0
Robust Speech Representation Learning via Flow-based Embedding Regularization0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.