SOTAVerified

Voice Conversion

I remember all the summer days Drinking wine in the sunshine I hope it never leaves And I remember all the summer nights Staring at you in the moonlight I hope you never leave 'cause baby You're so good to me You have all that all that I ever need It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you I remember all those winter days frozen In the cold tryin' to get you home Should I be moving in, we can be together then Remember spending all those winter nights Stayin' inside by the warm fire Yeah you gotta know that I can never let you go You and I have the rest of our lives to say It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you Can anybody else see it? Mm, can anybody else see what I do? Can anybody else feel it? Oh, can anybody else feel the way I do? But now I'm with you Hard to forget all the moments when We'd be sitting there hoping it would never end 'Cause this is meant to be So baby, will you marry me? It's easy to love you So easy to love you Ooh, you know it's true The best part of being with you To know you are with me It's not so hard to say It's easy to love you You and me will be together I know our love will last forever You and me will be together I know our love will last forever You know it's true The best part of being with you You're easy to love

Source: Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet

Papers

Showing 201250 of 520 papers

TitleStatusHype
GPU-Friendly Local Regression for Voice Conversion0
Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech0
SongBsAb: A Dual Prevention Approach against Singing Voice Conversion based Illegal Song Covers0
Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment0
Hierarchical Sequence to Sequence Voice Conversion with Limited Data0
Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion0
EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion0
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer0
High Fidelity Speech Regeneration with Application to Speech Enhancement0
High-quality nonparallel voice conversion based on cycle-consistent adversarial network0
Comparison of Speech Representations for the MOS Prediction System0
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion0
Adversarial speech for voice privacy protection from Personalized Speech generation0
Identifying Source Speakers for Voice Conversion based Spoofing Attacks on Speaker Verification Systems0
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion0
Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder0
Improve few-shot voice cloning using multi-modal learning0
Improving child speech recognition with augmented child-like speech0
Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features0
Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models0
Improving Voice Conversion for Dissimilar Speakers Using Perceptual Losses0
Improving Voice Quality in Speech Anonymization With Just Perception-Informed Losses0
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion0
Exploring synthetic data for cross-speaker style transfer in style representation based TTS0
Exploring data augmentation in bias mitigation against non-native-accented speech0
Individuality-Preserving Voice Conversion for Articulation Disorders Using Locality-Constrained NMF0
Invertible Voice Conversion0
Investigating Inter- and Intra-speaker Voice Conversion using Audiobooks0
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Investigating self-supervised features for expressive, multilingual voice conversion0
A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 20230
Evaluation of Speaker Anonymization on Emotional Speech0
Investigation of using disentangled and interpretable representations with language conditioning for cross-lingual voice conversion0
IQDUBBING: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion0
Evaluating Voice Conversion-based Privacy Protection against Informed Attackers0
Iteratively Improving Speech Recognition and Voice Conversion0
Combining Automatic Speaker Verification and Prosody Analysis for Synthetic Speech Detection0
Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet0
Eta-WavLM: Efficient Speaker Identity Removal in Self-Supervised Speech Representations Using a Simple Linear Equation0
Latent linguistic embedding for cross-lingual text-to-speech and voice conversion0
LCM-SVC: Latent Diffusion Model Based Singing Voice Conversion with Inference Acceleration via Latent Consistency Distillation0
LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance0
Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion0
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion0
Learning Paralinguistic Features from Audiobooks through Style Voice Conversion0
Learning Singing From Speech0
Application of ASV for Voice Identification after VC and Duration Predictor Improvement in TTS Models0
AASIST3: KAN-Enhanced AASIST Speech Deepfake Detection using SSL Features and Additional Regularization for the ASVspoof 2024 Challenge0
Error Reduction Network for DBLSTM-based Voice Conversion0
ClsVC: Learning Speech Representations with two different classification tasks.0
Show:102550
← PrevPage 5 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VQ-CPCSpeaker Similarity3.8Unverified
2VQ-VAESpeaker Similarity3.49Unverified
#ModelMetricClaimedVerifiedStatus
1kNN-VC (prematched HiFiGAN)Character Error Rate (CER)2.96Unverified
#ModelMetricClaimedVerifiedStatus
1DISSCTotal Length Error (TLE)0.83Unverified