| Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data | Jun 29, 2023 | Machine TranslationProsody Prediction | —Unverified | 0 | 0 |
| Learning Source Disentanglement in Neural Audio Codec | Sep 17, 2024 | Audio CompressionAudio Generation | —Unverified | 0 | 0 |
| Noise Morphing for Audio Time Stretching | Dec 22, 2023 | Resynthesis | —Unverified | 0 | 0 |
| On the Parameter Estimation of Sinusoidal Models for Speech and Audio Signals | Jan 2, 2024 | parameter estimationResynthesis | —Unverified | 0 | 0 |
| A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation | Oct 29, 2024 | Resynthesis | —Unverified | 0 | 0 |
| ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement | Dec 21, 2022 | Audio-Visual Speech RecognitionResynthesis | —Unverified | 0 | 0 |
| ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration | Jan 1, 2023 | Audio-Visual Speech RecognitionResynthesis | —Unverified | 0 | 0 |
| Runtime Tunable Tsetlin Machines for Edge Inference on eFPGAs | Feb 10, 2025 | Model CompressionResynthesis | —Unverified | 0 | 0 |
| Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge | Oct 27, 2022 | Acoustic Unit DiscoveryLanguage Modeling | —Unverified | 0 | 0 |