| A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition | Jan 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech Representation Learning Through Self-supervised Pretraining And Multi-task Finetuning | Oct 18, 2021 | Multi-Task LearningRepresentation Learning | —Unverified | 0 |
| Application of Knowledge Distillation to Multi-task Speech Representation Learning | Oct 29, 2022 | Keyword SpottingKnowledge Distillation | —Unverified | 0 |
| Are Music Foundation Models Better at Singing Voice Deepfake Detection? Far-Better Fuse them with Speech Foundation Models | Sep 21, 2024 | DeepFake DetectionFace Swapping | —Unverified | 0 |
| ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems | Feb 17, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning | Apr 8, 2022 | Representation LearningSelf-Supervised Learning | —Unverified | 0 |
| Characterizing the adversarial vulnerability of speech self-supervised learning | Nov 8, 2021 | Adversarial RobustnessBenchmarking | —Unverified | 0 |
| UniWav: Towards Unified Pre-training for Speech Representation Learning and Generation | Mar 2, 2025 | DecoderRepresentation Learning | —Unverified | 0 |
| A low latency attention module for streaming self-supervised speech representation learning | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Conditional independence for pretext task selection in Self-supervised speech representation learning | Apr 15, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |