| Self-supervised speech representation learning for keyword-spotting with light-weight transformers | Mar 7, 2023 | Keyword SpottingRepresentation Learning | —Unverified | 0 |
| UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization | Jan 26, 2024 | DecoderDomain Adaptation | —Unverified | 0 |
| Similarity Analysis of Self-Supervised Speech Representations | Oct 22, 2020 | Representation LearningSpeech Representation Learning | —Unverified | 0 |
| Simultaneous or Sequential Training? How Speech Representations Cooperate in a Multi-Task Self-Supervised Learning System | Jun 5, 2023 | Multi-Task LearningRepresentation Learning | —Unverified | 0 |
| Universal Semantic Disentangled Privacy-preserving Speech Representation Learning | May 19, 2025 | DecoderPrivacy Preserving | —Unverified | 0 |
| Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio | Oct 17, 2023 | Representation LearningSelf-Supervised Learning | —Unverified | 0 |
| Speech representation learning: Learning bidirectional encoders with single-view, multi-view, and multi-task methods | Jul 25, 2023 | MULTI-VIEW LEARNINGRepresentation Learning | —Unverified | 0 |
| Speech Representation Learning Revisited: The Necessity of Separate Learnable Parameters and Robust Data Augmentation | Aug 20, 2024 | Data AugmentationRepresentation Learning | —Unverified | 0 |
| Adversarially learning disentangled speech representations for robust multi-factor voice conversion | Jan 30, 2021 | Representation LearningRhythm | —Unverified | 0 |
| An empirical study on speech restoration guided by self supervised speech representation | May 30, 2023 | Representation LearningSpeech Representation Learning | —Unverified | 0 |
| A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition | Jan 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning | Oct 18, 2021 | Multi-Task LearningRepresentation Learning | —Unverified | 0 |
| Application of Knowledge Distillation to Multi-task Speech Representation Learning | Oct 29, 2022 | Keyword SpottingKnowledge Distillation | —Unverified | 0 |
| Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models | Sep 21, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems | Feb 17, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning | Apr 8, 2022 | Representation LearningSelf-Supervised Learning | —Unverified | 0 |
| Characterizing the adversarial vulnerability of speech self-supervised learning | Nov 8, 2021 | Adversarial RobustnessBenchmarking | —Unverified | 0 |
| k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning | Nov 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning | Apr 8, 2022 | Contrastive LearningData Augmentation | CodeCode Available | 0 |
| MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning | Oct 17, 2023 | DisentanglementRepresentation Learning | CodeCode Available | 0 |
| Sampling strategies in Siamese Networks for unsupervised speech representation learning | Apr 30, 2018 | Representation LearningSpeech Representation Learning | CodeCode Available | 0 |
| Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition | Apr 1, 2022 | Phoneme RecognitionRepresentation Learning | CodeCode Available | 0 |
| PADA: Pruning Assisted Domain Adaptation for Self-Supervised Speech Representations | Mar 31, 2022 | Domain AdaptationLanguage Modelling | CodeCode Available | 0 |
| A multimodal dynamical variational autoencoder for audiovisual speech representation learning | May 5, 2023 | DenoisingDisentanglement | CodeCode Available | 0 |
| Pretext Tasks selection for multitask self-supervised speech representation learning | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |