Voice Conversion

I remember all the summer days Drinking wine in the sunshine I hope it never leaves And I remember all the summer nights Staring at you in the moonlight I hope you never leave 'cause baby You're so good to me You have all that all that I ever need It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you I remember all those winter days frozen In the cold tryin' to get you home Should I be moving in, we can be together then Remember spending all those winter nights Stayin' inside by the warm fire Yeah you gotta know that I can never let you go You and I have the rest of our lives to say It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you Can anybody else see it? Mm, can anybody else see what I do? Can anybody else feel it? Oh, can anybody else feel the way I do? But now I'm with you Hard to forget all the moments when We'd be sitting there hoping it would never end 'Cause this is meant to be So baby, will you marry me? It's easy to love you So easy to love you Ooh, you know it's true The best part of being with you To know you are with me It's not so hard to say It's easy to love you You and me will be together I know our love will last forever You and me will be together I know our love will last forever You know it's true The best part of being with you You're easy to love

Source: Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–520 of 520 papers

Title	Date	Tasks	Status
Mel-spectrogram augmentation for sequence to sequence voice conversion	Jan 6, 2020	Voice Conversion	CodeCode Available
Deep Residual Neural Networks for Audio Spoofing Detection	Jun 30, 2019	Speaker VerificationSpeech Synthesis	CodeCode Available
MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms	Oct 8, 2019	Generative Adversarial NetworkMusic Style Transfer	CodeCode Available
AdaGAN: Adaptive GAN for Many-to-Many Non-Parallel Voice Conversion	Sep 25, 2019	Generative Adversarial NetworkStyle Transfer	CodeCode Available
Vocoder-free End-to-End Voice Conversion with Transformer Network	Feb 5, 2020	speech-recognitionSpeech Recognition	CodeCode Available
Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis	Apr 1, 2022	Speech SynthesisVoice Conversion	CodeCode Available
Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion	May 25, 2023	Audio Deepfake DetectionDeepFake Detection	CodeCode Available
Unsupervised End-to-End Learning of Discrete Linguistic Units for Voice Conversion	May 28, 2019	DecoderVoice Conversion	CodeCode Available
Hear Your Face: Face-based voice conversion with F0 estimation	Aug 19, 2024	Voice Conversion	CodeCode Available
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS	Oct 6, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Parallel-Data-Free Voice Conversion Using Cycle-Consistent Adversarial Networks	Nov 30, 2017	Voice Conversion	CodeCode Available
Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised Learning	Jun 5, 2020	Self-Supervised LearningSpeaker Verification	CodeCode Available
Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech	Jun 2, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks	Oct 4, 2021	DecoderVoice Conversion	CodeCode Available
Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition	May 19, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Generative Adversarial Networks for Unpaired Voice Transformation on Impaired Speech	Oct 30, 2018	Speech RecognitionVoice Conversion	CodeCode Available
NVC-Net: End-to-End Adversarial Voice Conversion	Jun 2, 2021	GPUSpeech Synthesis	CodeCode Available
Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals	Jan 19, 2018	Speech SynthesisVoice Conversion	CodeCode Available
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks	Apr 1, 2019	Feature Engineeringtext-to-speech	CodeCode Available
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation	Jul 18, 2021	Data AugmentationEmotion Recognition	CodeCode Available

Show:10 25 50

← PrevPage 11 of 11Next →

All datasets ZeroSpeech 2019 English LibriSpeech test-clean VCTK

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	VQ-CPC	Speaker Similarity	3.8	—	Unverified
2	VQ-VAE	Speaker Similarity	3.49	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	kNN-VC (prematched HiFiGAN)	Character Error Rate (CER)	2.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DISSC	Total Length Error (TLE)	0.83	—	Unverified