Voice Conversion

I remember all the summer days Drinking wine in the sunshine I hope it never leaves And I remember all the summer nights Staring at you in the moonlight I hope you never leave 'cause baby You're so good to me You have all that all that I ever need It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you I remember all those winter days frozen In the cold tryin' to get you home Should I be moving in, we can be together then Remember spending all those winter nights Stayin' inside by the warm fire Yeah you gotta know that I can never let you go You and I have the rest of our lives to say It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you Can anybody else see it? Mm, can anybody else see what I do? Can anybody else feel it? Oh, can anybody else feel the way I do? But now I'm with you Hard to forget all the moments when We'd be sitting there hoping it would never end 'Cause this is meant to be So baby, will you marry me? It's easy to love you So easy to love you Ooh, you know it's true The best part of being with you To know you are with me It's not so hard to say It's easy to love you You and me will be together I know our love will last forever You and me will be together I know our love will last forever You know it's true The best part of being with you You're easy to love

Source: Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 520 papers

Title	Date	Tasks	Status
Cross-speaker style transfer for text-to-speech using data augmentation	Feb 10, 2022	Data AugmentationStyle Transfer	—Unverified
Invertible Voice Conversion	Jan 26, 2022	Voice Conversion	—Unverified
The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition	Jan 13, 2022	Generative Adversarial NetworkPhoneme Recognition	—Unverified
Emotion Intensity and its Control for Emotional Voice Conversion	Jan 10, 2022	Emotion ClassificationVoice Conversion	—Unverified
A Practical Guide to Logical Access Voice Presentation Attack Detection	Jan 10, 2022	Artifact DetectionSpeaker Verification	CodeCode Available
Adversarial Transformation of Spoofing Attacks for Voice Biometrics	Jan 4, 2022	Speaker VerificationVoice Conversion	—Unverified
IQDUBBING: Prosody modeling based on discrete self-supervised speech representation for expressive voice conversion	Jan 2, 2022	QuantizationVoice Conversion	—Unverified
The exploitation of Multiple Feature Extraction Techniques for Speaker Identification in Emotional States under Disguised Voices	Dec 15, 2021	Speaker IdentificationVoice Conversion	—Unverified
Training Robust Zero-Shot Voice Conversion Models with Self-supervised Features	Dec 8, 2021	DecoderSelf-Supervised Learning	—Unverified
Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion	Dec 6, 2021	DecoderVoice Conversion	—Unverified
VoiceMixer: Adversarial Voice Style Mixup	Dec 1, 2021	DisentanglementRepresentation Learning	—Unverified
One-shot Voice Conversion For Style Transfer Based On Speaker Adaptation	Nov 24, 2021	Style TransferVoice Conversion	—Unverified
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion	Nov 12, 2021	Voice Conversion	—Unverified
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines	Nov 6, 2021	DisentanglementSpeaker Verification	CodeCode Available
Voice Conversion Can Improve ASR in Very Low-Resource Settings	Nov 4, 2021	Data Augmentationspeech-recognition	—Unverified
Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning	Oct 27, 2021	DisentanglementRepresentation Learning	—Unverified
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion	Oct 20, 2021	DisentanglementVoice Conversion	—Unverified
Speech Enhancement-assisted Voice Conversion in Noisy Environments	Oct 19, 2021	Speech EnhancementVoice Conversion	—Unverified
CycleFlow: Purify Information Factors by Cycle Loss	Oct 18, 2021	Voice Conversion	—Unverified
Towards Identity Preserving Normal to Dysarthric Voice Conversion	Oct 15, 2021	Data AugmentationDecision Making	—Unverified
Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models	Oct 13, 2021	ResynthesisSpeaker anonymization	—Unverified
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding	Oct 13, 2021	Speech SynthesisVoice Conversion	—Unverified
Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding	Oct 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks	Oct 4, 2021	DecoderVoice Conversion	CodeCode Available
Incorporating speaker embedding and post-filter network for improving speaker similarity of personalized speech synthesis system	Oct 1, 2021	Speaker VerificationSpeech Synthesis	—Unverified
Adaptive Speech Duration Modification using a Deep-Generative Framework	Sep 29, 2021	DecoderDynamic Time Warping	—Unverified
ClsVC: Learning Speech Representations with two different classification tasks.	Sep 29, 2021	ClassificationVocal Bursts Valence Prediction	—Unverified
Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion	Sep 8, 2021	Dynamic Time WarpingSpeech Enhancement	—Unverified
Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection	Sep 1, 2021	Speaker VerificationSpeech Synthesis	—Unverified
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform	Aug 12, 2021	Speaker VerificationSynthetic Speech Detection	—Unverified
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition	Aug 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations	Jul 26, 2021	Voice Conversion	—Unverified
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model	Jul 20, 2021	Voice ConversionVoice Similarity	CodeCode Available
On Prosody Modeling for ASR+TTS based Voice Conversion	Jul 20, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation	Jul 18, 2021	Data AugmentationEmotion Recognition	CodeCode Available
Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder	Jul 11, 2021	DisentanglementVoice Conversion	—Unverified
A Deep-Bayesian Framework for Adaptive Speech Duration Modification	Jul 11, 2021	DecoderDynamic Time Warping	—Unverified
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer	Jul 8, 2021	Emotion RecognitionSpeech Emotion Recognition	—Unverified
An Objective Evaluation Framework for Pathological Speech Synthesis	Jul 1, 2021	Speech SynthesisVoice Conversion	—Unverified
Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments	Jun 16, 2021	DecoderVoice Conversion	CodeCode Available
Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion	Jun 16, 2021	Style TransferVoice Conversion	—Unverified
Pathological voice adaptation with autoencoder-based voice conversion	Jun 15, 2021	Speech SynthesisVoice Conversion	—Unverified
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion	Jun 2, 2021	Voice Conversion	—Unverified
NVC-Net: End-to-End Adversarial Voice Conversion	Jun 2, 2021	GPUSpeech Synthesis	CodeCode Available
Learning Paralinguistic Features from Audiobooks through Style Voice Conversion	Jun 1, 2021	Emotion RecognitionStyle Detection	—Unverified
StarGAN-ZSVC: Towards Zero-Shot Voice Conversion in Low-Resource Contexts	May 31, 2021	Voice Conversion	—Unverified
DiffSVC: A Diffusion Probabilistic Model for Singing Voice Conversion	May 28, 2021	DenoisingVoice Conversion	—Unverified
Voice Conversion Based Speaker Normalization for Acoustic Unit Discovery	May 4, 2021	Acoustic Unit DiscoveryVoice Conversion	—Unverified
An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion	Apr 25, 2021	Generative Adversarial NetworkSpeech Synthesis	—Unverified
Towards end-to-end F0 voice conversion based on Dual-GAN with convolutional wavelet kernels	Apr 15, 2021	Voice Conversion	—Unverified

Show:10 25 50

← PrevPage 8 of 11Next →

All datasets ZeroSpeech 2019 English LibriSpeech test-clean VCTK

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	VQ-CPC	Speaker Similarity	3.8	—	Unverified
2	VQ-VAE	Speaker Similarity	3.49	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	kNN-VC (prematched HiFiGAN)	Character Error Rate (CER)	2.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DISSC	Total Length Error (TLE)	0.83	—	Unverified