| ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems | Feb 17, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning | Apr 8, 2022 | Representation LearningSelf-Supervised Learning | —Unverified | 0 | 0 |
| Characterizing the adversarial vulnerability of speech self-supervised learning | Nov 8, 2021 | Adversarial RobustnessBenchmarking | —Unverified | 0 | 0 |
| UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation | Mar 2, 2025 | DecoderRepresentation Learning | —Unverified | 0 | 0 |
| Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling | Jun 17, 2019 | Representation LearningSpeech Representation Learning | —Unverified | 0 | 0 |
| Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks | Oct 14, 2021 | Audio ClassificationRepresentation Learning | —Unverified | 0 | 0 |
| An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning | Jul 26, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning | Jun 4, 2020 | BIG-bench Machine LearningContrastive Learning | —Unverified | 0 | 0 |
| Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks | Oct 23, 2019 | Representation LearningSpeech Representation Learning | —Unverified | 0 | 0 |
| XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception | Mar 21, 2024 | Audio-Visual Speech RecognitionRepresentation Learning | —Unverified | 0 | 0 |
| Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends | Jan 2, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning | Oct 16, 2022 | Audio GenerationRepresentation Learning | —Unverified | 0 | 0 |
| Disentangled Feature Learning for Real-Time Neural Speech Coding | Nov 22, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 | 0 |
| Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective | Apr 5, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 | 0 |
| Disentangled Speech Representation Learning for One-Shot Cross-lingual Voice Conversion Using β-VAE | Oct 25, 2022 | DisentanglementRepresentation Learning | —Unverified | 0 | 0 |
| Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition? | May 4, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Efficiency-oriented approaches for self-supervised speech representation learning | Dec 18, 2023 | Automatic Speech RecognitionRepresentation Learning | —Unverified | 0 | 0 |
| DuRep: Dual-Mode Speech Representation Learning via ASR-Aware Distillation | May 26, 2025 | Representation LearningSpeech Representation Learning | —Unverified | 0 | 0 |
| Efficient Speech Representation Learning with Low-Bit Quantization | Dec 14, 2022 | Model CompressionQuantization | —Unverified | 0 | 0 |
| Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks | Apr 1, 2021 | DecoderRepresentation Learning | —Unverified | 0 | 0 |
| Emotion-Aware Speech Self-Supervised Representation Learning with Intensity Knowledge | Jun 10, 2024 | Representation LearningSelf-Supervised Learning | —Unverified | 0 | 0 |
| A Deep Paradigm for Articulatory Speech Representation Learning via Neural Convolutive Sparse Matrix Factorization | Jan 16, 2022 | Phoneme RecognitionRepresentation Learning | —Unverified | 0 | 0 |
| Evaluating Self-Supervised Speech Representations for Indigenous American Languages | Oct 5, 2023 | Representation LearningSpeech Representation Learning | —Unverified | 0 | 0 |
| Exploring wav2vec 2.0 on speaker verification and language identification | Dec 11, 2020 | Language IdentificationMulti-Task Learning | —Unverified | 0 | 0 |
| XTREME-S: Evaluating Cross-lingual Speech Representations | Mar 21, 2022 | Representation LearningRetrieval | —Unverified | 0 | 0 |
| Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE | Jun 6, 2022 | Representation LearningSpeech Representation Learning | —Unverified | 0 | 0 |
| Towards Learning Fine-Grained Disentangled Representations from Speech | Aug 8, 2018 | Representation LearningSpeech Representation Learning | —Unverified | 0 | 0 |
| Flowchase: a Mobile Application for Pronunciation Training | Jul 5, 2023 | Representation LearningSpeech Representation Learning | —Unverified | 0 | 0 |
| General-Purpose Speech Representation Learning through a Self-Supervised Multi-Granularity Framework | Feb 3, 2021 | ClassificationEmotion Classification | —Unverified | 0 | 0 |
| Input-independent Attention Weights Are Expressive Enough: A Study of Attention in Self-supervised Audio Transformers | Jun 9, 2020 | General ClassificationRepresentation Learning | —Unverified | 0 | 0 |
| Towards Robust Speech Representation Learning for Thousands of Languages | Jun 30, 2024 | Representation LearningSelf-Supervised Learning | —Unverified | 0 | 0 |