| textless-lib: a Library for Textless Spoken Language Processing | Feb 15, 2022 | Resynthesis | CodeCode Available | 2 |
| AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder | Jan 9, 2025 | Pitch ClassificationPitch control | CodeCode Available | 1 |
| EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models | Dec 21, 2023 | ResynthesisSpeech-to-Speech Translation | CodeCode Available | 1 |
| Speaker-Independent Acoustic-to-Articulatory Speech Inversion | Feb 14, 2023 | Resynthesis | CodeCode Available | 1 |
| Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling | Jan 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Differentiable Time-Frequency Scattering on GPU | Apr 18, 2022 | Audio GenerationCPU | CodeCode Available | 1 |
| Speech Resynthesis from Discrete Disentangled Self-Supervised Representations | Apr 1, 2021 | DisentanglementRepresentation Learning | CodeCode Available | 1 |
| Generative Spoken Language Modeling from Raw Audio | Feb 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Dynamical Variational Autoencoders: A Comprehensive Review | Aug 28, 2020 | 3D Human DynamicsResynthesis | CodeCode Available | 1 |
| Spoken Language Modeling with Duration-Penalized Self-Supervised Units | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |