SOTAVerified

Resynthesis

Papers

Showing 125 of 51 papers

TitleStatusHype
Spoken Language Modeling with Duration-Penalized Self-Supervised UnitsCode0
Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information0
Dynamic Tsetlin Machine Accelerators for On-Chip Training at the Edge using FPGAs0
Runtime Tunable Tsetlin Machines for Edge Inference on eFPGAs0
FocalCodec: Low-Bitrate Speech Coding via Focal Modulation Networks0
AnCoGen: Analysis, Control and Generation of Speech with a Masked AutoencoderCode1
DC-Spin: A Speaker-invariant Speech Tokenizer for Spoken Language Models0
A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation0
Learning Source Disentanglement in Neural Audio Codec0
Automatic Voice Identification after Speech Resynthesis using PPG0
Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation0
On the Parameter Estimation of Sinusoidal Models for Speech and Audio Signals0
Noise Morphing for Audio Time Stretching0
EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech ModelsCode1
AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement0
Evaluation of the Speech Resynthesis Capabilities of the VoicePrivacy Challenge Baseline B10
EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis0
Weakly-supervised Contrastive Learning for Unsupervised Object DiscoveryCode0
Learning Multilingual Expressive Speech Representation for Prosody Prediction without Parallel Data0
In-the-wild Speech Emotion Conversion Using Disentangled Self-Supervised Representations and Neural Vocoder-based Resynthesis0
How Generative Spoken Language Modeling Encodes Noisy Speech: Investigation from Phonetics to Syntactics0
Implementation of a framework for deploying AI inference engines in FPGAs0
Speaker-Independent Acoustic-to-Articulatory Speech InversionCode1
Analysing Discrete Self Supervised Speech Representation for Spoken Language ModelingCode1
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.