| Spoken Language Modeling with Duration-Penalized Self-Supervised Units | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dynamic Tsetlin Machine Accelerators for On-Chip Training at the Edge using FPGAs | Apr 28, 2025 | Resynthesis | —Unverified | 0 |
| Runtime Tunable Tsetlin Machines for Edge Inference on eFPGAs | Feb 10, 2025 | Model CompressionResynthesis | —Unverified | 0 |
| FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks | Feb 6, 2025 | ResynthesisVoice Conversion | —Unverified | 0 |
| AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder | Jan 9, 2025 | Pitch ClassificationPitch control | CodeCode Available | 1 |
| DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models | Oct 31, 2024 | DecoderResynthesis | —Unverified | 0 |
| A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation | Oct 29, 2024 | Resynthesis | —Unverified | 0 |
| Learning Source Disentanglement in Neural Audio Codec | Sep 17, 2024 | Audio CompressionAudio Generation | —Unverified | 0 |
| Automatic Voice Identification after Speech Resynthesis using PPG | Aug 5, 2024 | ResynthesisSpeaker Verification | —Unverified | 0 |