SOTAVerified

Voice Conversion

I remember all the summer days Drinking wine in the sunshine I hope it never leaves And I remember all the summer nights Staring at you in the moonlight I hope you never leave 'cause baby You're so good to me You have all that all that I ever need It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you I remember all those winter days frozen In the cold tryin' to get you home Should I be moving in, we can be together then Remember spending all those winter nights Stayin' inside by the warm fire Yeah you gotta know that I can never let you go You and I have the rest of our lives to say It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you Can anybody else see it? Mm, can anybody else see what I do? Can anybody else feel it? Oh, can anybody else feel the way I do? But now I'm with you Hard to forget all the moments when We'd be sitting there hoping it would never end 'Cause this is meant to be So baby, will you marry me? It's easy to love you So easy to love you Ooh, you know it's true The best part of being with you To know you are with me It's not so hard to say It's easy to love you You and me will be together I know our love will last forever You and me will be together I know our love will last forever You know it's true The best part of being with you You're easy to love

Source: Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet

Papers

Showing 251275 of 520 papers

TitleStatusHype
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder0
Speaker-independent raw waveform model for glottal excitation0
Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN0
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition0
Speech Enhancement-assisted Voice Conversion in Noisy Environments0
Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives0
Speech Synthesis along Perceptual Voice Quality Dimensions0
ZSVC: Zero-shot Style Voice Conversion with Disentangled Latent Diffusion Models and Adversarial Training0
Automatic Voice Identification after Speech Resynthesis using PPG0
RT-VC: Real-Time Zero-Shot Voice Conversion with Speech Articulatory Coding0
AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge0
ACE-VC: Adaptive and Controllable Voice Conversion using Explicitly Disentangled Self-supervised Speech Representations0
A Comparative Analysis Of Latent Regressor Losses For Singing Voice Conversion0
A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 20230
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion0
Adaptive Speech Duration Modification using a Deep-Generative Framework0
AdaptVC: High Quality Voice Conversion with Adaptive Learning0
A Deep-Bayesian Framework for Adaptive Speech Duration Modification0
Adversarially learning disentangled speech representations for robust multi-factor voice conversion0
Adversarially Trained Autoencoders for Parallel-Data-Free Voice Conversion0
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion0
Adversarial speech for voice privacy protection from Personalized Speech generation0
Adversarial Transformation of Spoofing Attacks for Voice Biometrics0
AE-Flow: AutoEncoder Normalizing Flow0
A Hierarchical Speaker Representation Framework for One-shot Singing Voice Conversion0
Show:102550
← PrevPage 11 of 21Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VQ-CPCSpeaker Similarity3.8Unverified
2VQ-VAESpeaker Similarity3.49Unverified
#ModelMetricClaimedVerifiedStatus
1kNN-VC (prematched HiFiGAN)Character Error Rate (CER)2.96Unverified
#ModelMetricClaimedVerifiedStatus
1DISSCTotal Length Error (TLE)0.83Unverified