| textless-lib: a Library for Textless Spoken Language Processing | Feb 15, 2022 | Resynthesis | CodeCode Available | 2 |
| AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder | Jan 9, 2025 | Pitch ClassificationPitch control | CodeCode Available | 1 |
| EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models | Dec 21, 2023 | ResynthesisSpeech-to-Speech Translation | CodeCode Available | 1 |
| Speaker-Independent Acoustic-to-Articulatory Speech Inversion | Feb 14, 2023 | Resynthesis | CodeCode Available | 1 |
| Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling | Jan 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Differentiable Time-Frequency Scattering on GPU | Apr 18, 2022 | Audio GenerationCPU | CodeCode Available | 1 |
| Speech Resynthesis from Discrete Disentangled Self-Supervised Representations | Apr 1, 2021 | DisentanglementRepresentation Learning | CodeCode Available | 1 |
| Generative Spoken Language Modeling from Raw Audio | Feb 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Dynamical Variational Autoencoders: A Comprehensive Review | Aug 28, 2020 | 3D Human DynamicsResynthesis | CodeCode Available | 1 |
| Spoken Language Modeling with Duration-Penalized Self-Supervised Units | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dynamic Tsetlin Machine Accelerators for On-Chip Training at the Edge using FPGAs | Apr 28, 2025 | Resynthesis | —Unverified | 0 |
| Runtime Tunable Tsetlin Machines for Edge Inference on eFPGAs | Feb 10, 2025 | Model CompressionResynthesis | —Unverified | 0 |
| FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks | Feb 6, 2025 | ResynthesisVoice Conversion | —Unverified | 0 |
| DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models | Oct 31, 2024 | DecoderResynthesis | —Unverified | 0 |
| A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation | Oct 29, 2024 | Resynthesis | —Unverified | 0 |
| Learning Source Disentanglement in Neural Audio Codec | Sep 17, 2024 | Audio CompressionAudio Generation | —Unverified | 0 |
| Automatic Voice Identification after Speech Resynthesis using PPG | Aug 5, 2024 | ResynthesisSpeaker Verification | —Unverified | 0 |
| Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation | Jul 8, 2024 | Automatic Speech RecognitionEmotion Recognition | —Unverified | 0 |
| On the Parameter Estimation of Sinusoidal Models for Speech and Audio Signals | Jan 2, 2024 | parameter estimationResynthesis | —Unverified | 0 |
| Noise Morphing for Audio Time Stretching | Dec 22, 2023 | Resynthesis | —Unverified | 0 |
| AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement | Sep 14, 2023 | ResynthesisSpeech Enhancement | —Unverified | 0 |
| Evaluation of the Speech Resynthesis Capabilities of the VoicePrivacy Challenge Baseline B1 | Aug 22, 2023 | ResynthesisSpeaker anonymization | —Unverified | 0 |
| EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis | Aug 10, 2023 | ResynthesisSpeech Synthesis | —Unverified | 0 |
| Weakly-supervised Contrastive Learning for Unsupervised Object Discovery | Jul 7, 2023 | Contrastive LearningImage Reconstruction | CodeCode Available | 0 |