| textless-lib: a Library for Textless Spoken Language Processing | Feb 15, 2022 | Resynthesis | CodeCode Available | 2 |
| AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder | Jan 9, 2025 | Pitch ClassificationPitch control | CodeCode Available | 1 |
| EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models | Dec 21, 2023 | ResynthesisSpeech-to-Speech Translation | CodeCode Available | 1 |
| Speaker-Independent Acoustic-to-Articulatory Speech Inversion | Feb 14, 2023 | Resynthesis | CodeCode Available | 1 |
| Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling | Jan 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Differentiable Time-Frequency Scattering on GPU | Apr 18, 2022 | Audio GenerationCPU | CodeCode Available | 1 |
| Speech Resynthesis from Discrete Disentangled Self-Supervised Representations | Apr 1, 2021 | DisentanglementRepresentation Learning | CodeCode Available | 1 |
| Generative Spoken Language Modeling from Raw Audio | Feb 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Dynamical Variational Autoencoders: A Comprehensive Review | Aug 28, 2020 | 3D Human DynamicsResynthesis | CodeCode Available | 1 |
| Spoken Language Modeling with Duration-Penalized Self-Supervised Units | May 29, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dynamic Tsetlin Machine Accelerators for On-Chip Training at the Edge using FPGAs | Apr 28, 2025 | Resynthesis | —Unverified | 0 |
| Runtime Tunable Tsetlin Machines for Edge Inference on eFPGAs | Feb 10, 2025 | Model CompressionResynthesis | —Unverified | 0 |
| FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks | Feb 6, 2025 | ResynthesisVoice Conversion | —Unverified | 0 |
| DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models | Oct 31, 2024 | DecoderResynthesis | —Unverified | 0 |
| A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation | Oct 29, 2024 | Resynthesis | —Unverified | 0 |
| Learning Source Disentanglement in Neural Audio Codec | Sep 17, 2024 | Audio CompressionAudio Generation | —Unverified | 0 |
| Automatic Voice Identification after Speech Resynthesis using PPG | Aug 5, 2024 | ResynthesisSpeaker Verification | —Unverified | 0 |
| Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation | Jul 8, 2024 | Automatic Speech RecognitionEmotion Recognition | —Unverified | 0 |
| On the Parameter Estimation of Sinusoidal Models for Speech and Audio Signals | Jan 2, 2024 | parameter estimationResynthesis | —Unverified | 0 |
| Noise Morphing for Audio Time Stretching | Dec 22, 2023 | Resynthesis | —Unverified | 0 |
| AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement | Sep 14, 2023 | ResynthesisSpeech Enhancement | —Unverified | 0 |
| Evaluation of the Speech Resynthesis Capabilities of the VoicePrivacy Challenge Baseline B1 | Aug 22, 2023 | ResynthesisSpeaker anonymization | —Unverified | 0 |
| EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis | Aug 10, 2023 | ResynthesisSpeech Synthesis | —Unverified | 0 |
| Weakly-supervised Contrastive Learning for Unsupervised Object Discovery | Jul 7, 2023 | Contrastive LearningImage Reconstruction | CodeCode Available | 0 |
| Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data | Jun 29, 2023 | Machine TranslationProsody Prediction | —Unverified | 0 |
| In-the-wild Speech Emotion Conversion Using Disentangled Self-Supervised Representations and Neural Vocoder-based Resynthesis | Jun 2, 2023 | Resynthesis | —Unverified | 0 |
| How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics | Jun 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Implementation of a framework for deploying AI inference engines in FPGAs | May 30, 2023 | QuantizationResynthesis | —Unverified | 0 |
| ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration | Jan 1, 2023 | Audio-Visual Speech RecognitionResynthesis | —Unverified | 0 |
| ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement | Dec 21, 2022 | Audio-Visual Speech RecognitionResynthesis | —Unverified | 0 |
| Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge | Oct 27, 2022 | Acoustic Unit DiscoveryLanguage Modeling | —Unverified | 0 |
| An Initial study on Birdsong Re-synthesis Using Neural Vocoders | Sep 21, 2022 | ResynthesisSpeech Synthesis | —Unverified | 0 |
| DDX7: Differentiable FM Synthesis of Musical Instrument Sounds | Aug 12, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Experiments on Anomaly Detection in Autonomous Driving by Forward-Backward Style Transfers | Jul 13, 2022 | Anomaly DetectionAutonomous Driving | —Unverified | 0 |
| DrumGAN VST: A Plugin for Drum Sound Analysis/Synthesis With Autoencoding Generative Adversarial Networks | Jun 29, 2022 | Generative Adversarial NetworkResynthesis | —Unverified | 0 |
| A Perceptual Measure for Evaluating the Resynthesis of Automatic Music Transcriptions | Feb 24, 2022 | Music TranscriptionResynthesis | CodeCode Available | 0 |
| Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models | Oct 13, 2021 | ResynthesisSpeaker anonymization | —Unverified | 0 |
| Spectral Processing of COVID-19 Time-Series Data | Aug 13, 2020 | ResynthesisTime Series | CodeCode Available | 0 |
| Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement | Nov 14, 2019 | ResynthesisSpeech Enhancement | —Unverified | 0 |
| Parametric Resynthesis with neural vocoders | Jun 16, 2019 | Resynthesis | CodeCode Available | 0 |
| GazeCorrection:Self-Guided Eye Manipulation in the wild using Self-Supervised Generative Adversarial Networks | Jun 3, 2019 | Resynthesis | CodeCode Available | 0 |
| Coordinate-Based Texture Inpainting for Pose-Guided Human Image Generation | Jun 1, 2019 | Image GenerationResynthesis | —Unverified | 0 |
| Detecting the Unexpected via Image Resynthesis | Apr 16, 2019 | ResynthesisSemantic Segmentation | CodeCode Available | 0 |
| Speech denoising by parametric resynthesis | Apr 2, 2019 | DenoisingResynthesis | —Unverified | 0 |
| On Adversarial Mixup Resynthesis | Mar 7, 2019 | Resynthesis | CodeCode Available | 0 |
| Coordinate-based Texture Inpainting for Pose-Guided Image Generation | Nov 28, 2018 | Image GenerationPose-Guided Image Generation | CodeCode Available | 0 |
| Unifying Probabilistic Models for Time-Frequency Analysis | Nov 6, 2018 | Audio Signal ProcessingGaussian Processes | CodeCode Available | 0 |
| Sounderfeit: Cloning a Physical Model using a Conditional Adversarial Autoencoder | Jun 25, 2018 | parameter estimationResynthesis | —Unverified | 0 |
| Sounderfeit: Cloning a Physical Model with Conditional Adversarial Autoencoders | Feb 22, 2018 | parameter estimationResynthesis | —Unverified | 0 |