| Improved Self-Supervised Multilingual Speech Representation Learning Combined with Auxiliary Language Information | Dec 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction | Oct 28, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach | Oct 25, 2022 | Representation LearningSpeaker Recognition | —Unverified | 0 |
| Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement | Nov 12, 2022 | Data AugmentationEmotion Recognition | —Unverified | 0 |
| Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation | Jun 17, 2019 | ClusteringRepresentation Learning | —Unverified | 0 |
| INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition | May 25, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| JOOCI: a Framework for Learning Comprehensive Speech Representations | Oct 14, 2024 | Representation LearningSpeech Representation Learning | —Unverified | 0 |
| UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation | Mar 2, 2025 | DecoderRepresentation Learning | —Unverified | 0 |
| Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling | Jun 17, 2019 | Representation LearningSpeech Representation Learning | —Unverified | 0 |
| Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks | Oct 14, 2021 | Audio ClassificationRepresentation Learning | —Unverified | 0 |
| An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning | Jul 26, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning | Jun 4, 2020 | BIG-bench Machine LearningContrastive Learning | —Unverified | 0 |
| Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks | Oct 23, 2019 | Representation LearningSpeech Representation Learning | —Unverified | 0 |
| XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception | Mar 21, 2024 | Audio-Visual Speech RecognitionRepresentation Learning | —Unverified | 0 |
| Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends | Jan 2, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning | Oct 16, 2022 | Audio GenerationRepresentation Learning | —Unverified | 0 |
| Disentangled Feature Learning for Real-Time Neural Speech Coding | Nov 22, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective | Apr 5, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE | Oct 25, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition? | May 4, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficiency-oriented approaches for self-supervised speech representation learning | Dec 18, 2023 | Automatic Speech RecognitionRepresentation Learning | —Unverified | 0 |
| Label Aware Speech Representation Learning For Language Identification | Jun 7, 2023 | Language IdentificationMissing Labels | —Unverified | 0 |
| Language Adaptive Cross-lingual Speech Representation Learning with Sparse Sharing Sub-networks | Mar 9, 2022 | Representation Learningspeech-recognition | —Unverified | 0 |
| A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning | Jun 3, 2020 | Representation LearningSelf-Supervised Learning | —Unverified | 0 |
| Learning Cross-lingual Visual Speech Representations | Mar 14, 2023 | Representation LearningSelf-Supervised Learning | —Unverified | 0 |