Voice Conversion

I remember all the summer days Drinking wine in the sunshine I hope it never leaves And I remember all the summer nights Staring at you in the moonlight I hope you never leave 'cause baby You're so good to me You have all that all that I ever need It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you I remember all those winter days frozen In the cold tryin' to get you home Should I be moving in, we can be together then Remember spending all those winter nights Stayin' inside by the warm fire Yeah you gotta know that I can never let you go You and I have the rest of our lives to say It's easy to love you So easy to love you Ooh you know it's true The best part of being with you To know you're with me It's not so hard to say It's easy to love you Can anybody else see it? Mm, can anybody else see what I do? Can anybody else feel it? Oh, can anybody else feel the way I do? But now I'm with you Hard to forget all the moments when We'd be sitting there hoping it would never end 'Cause this is meant to be So baby, will you marry me? It's easy to love you So easy to love you Ooh, you know it's true The best part of being with you To know you are with me It's not so hard to say It's easy to love you You and me will be together I know our love will last forever You and me will be together I know our love will last forever You know it's true The best part of being with you You're easy to love

Source: Joint training framework for text-to-speech and voice conversion using multi-source Tacotron and WaveNet

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–350 of 520 papers

Title	Date	Tasks	Status	Hype
VoiceMixer: Adversarial Voice Style Mixup	Dec 1, 2021	DisentanglementRepresentation Learning	—Unverified	0
CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer	Nov 30, 2021	Voice Conversion	CodeCode Available	1
One-shot Voice Conversion For Style Transfer Based On Speaker Adaptation	Nov 24, 2021	Style TransferVoice Conversion	—Unverified	0
AC-VC: Non-parallel Low Latency Phonetic Posteriorgrams Based Voice Conversion	Nov 12, 2021	Voice Conversion	—Unverified	0
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines	Nov 6, 2021	DisentanglementSpeaker Verification	CodeCode Available	0
Voice Conversion Can Improve ASR in Very Low-Resource Settings	Nov 4, 2021	Data Augmentationspeech-recognition	—Unverified	0
A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion	Nov 3, 2021	Representation LearningVoice Conversion	CodeCode Available	1
Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations	Oct 27, 2021	Voice Conversion	CodeCode Available	1
Zero-shot Voice Conversion via Self-supervised Prosody Representation Learning	Oct 27, 2021	DisentanglementRepresentation Learning	—Unverified	0
Controllable and Interpretable Singing Voice Decomposition via Assem-VC	Oct 25, 2021	Voice Conversion	CodeCode Available	1
Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion	Oct 20, 2021	DisentanglementVoice Conversion	—Unverified	0
Speech Enhancement-assisted Voice Conversion in Noisy Environments	Oct 19, 2021	Speech EnhancementVoice Conversion	—Unverified	0
CycleFlow: Purify Information Factors by Cycle Loss	Oct 18, 2021	Voice Conversion	—Unverified	0
FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection	Oct 18, 2021	Speech SynthesisSynthetic Speech Detection	CodeCode Available	1
LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech	Oct 18, 2021	Voice Conversion	CodeCode Available	1
Towards Identity Preserving Normal to Dysarthric Voice Conversion	Oct 15, 2021	Data AugmentationDecision Making	—Unverified	0
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing	Oct 14, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	1
Toward Degradation-Robust Voice Conversion	Oct 14, 2021	DenoisingSpeech Enhancement	CodeCode Available	1
Exploring the Importance of F0 Trajectories for Speaker Anonymization using X-vectors and Neural Waveform Models	Oct 13, 2021	ResynthesisSpeaker anonymization	—Unverified	0
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding	Oct 13, 2021	Speech SynthesisVoice Conversion	—Unverified	0
S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations	Oct 12, 2021	BenchmarkingVoice Conversion	CodeCode Available	1
Towards High-fidelity Singing Voice Conversion with Acoustic Reference and Contrastive Predictive Coding	Oct 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
MediumVC: Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features	Oct 6, 2021	Voice Conversion	CodeCode Available	1
Decoupling Speaker-Independent Emotions for Voice Conversion Via Source-Filter Networks	Oct 4, 2021	DecoderVoice Conversion	CodeCode Available	0
Incorporating speaker embedding and post-filter network for improving speaker similarity of personalized speech synthesis system	Oct 1, 2021	Speaker VerificationSpeech Synthesis	—Unverified	0
ClsVC: Learning Speech Representations with two different classification tasks.	Sep 29, 2021	ClassificationVocal Bursts Valence Prediction	—Unverified	0
Adaptive Speech Duration Modification using a Deep-Generative Framework	Sep 29, 2021	DecoderDynamic Time Warping	—Unverified	0
Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme	Sep 28, 2021	Speech SynthesisVoice Conversion	CodeCode Available	1
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration	Sep 12, 2021	Decodertext-to-speech	CodeCode Available	1
Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion	Sep 8, 2021	Dynamic Time WarpingSpeech Enhancement	—Unverified	0
Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection	Sep 1, 2021	Speaker VerificationSpeech Synthesis	—Unverified	0
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform	Aug 12, 2021	Speaker VerificationSynthetic Speech Detection	—Unverified	0
StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition	Aug 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations	Jul 26, 2021	Voice Conversion	—Unverified	0
UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021	Jul 26, 2021	Audio CompressionFace Swapping	CodeCode Available	1
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion	Jul 21, 2021	Generative Adversarial Networktext-to-speech	CodeCode Available	1
SVSNet: An End-to-end Speaker Voice Similarity Assessment Model	Jul 20, 2021	Voice ConversionVoice Similarity	CodeCode Available	0
On Prosody Modeling for ASR+TTS based Voice Conversion	Jul 20, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation	Jul 18, 2021	Data AugmentationEmotion Recognition	CodeCode Available	0
Many-to-Many Voice Conversion based Feature Disentanglement using Variational Autoencoder	Jul 11, 2021	DisentanglementVoice Conversion	—Unverified	0
A Deep-Bayesian Framework for Adaptive Speech Duration Modification	Jul 11, 2021	DecoderDynamic Time Warping	—Unverified	0
Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer	Jul 8, 2021	Emotion RecognitionSpeech Emotion Recognition	—Unverified	0
An Objective Evaluation Framework for Pathological Speech Synthesis	Jul 1, 2021	Speech SynthesisVoice Conversion	—Unverified	0
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-shot Voice Conversion	Jun 18, 2021	DisentanglementQuantization	CodeCode Available	1
Voicy: Zero-Shot Non-Parallel Voice Conversion in Noisy Reverberant Environments	Jun 16, 2021	DecoderVoice Conversion	CodeCode Available	0
Enriching Source Style Transfer in Recognition-Synthesis based Non-Parallel Voice Conversion	Jun 16, 2021	Style TransferVoice Conversion	—Unverified	0
Pathological voice adaptation with autoencoder-based voice conversion	Jun 15, 2021	Speech SynthesisVoice Conversion	—Unverified	0
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion	Jun 2, 2021	Voice Conversion	—Unverified	0
NVC-Net: End-to-End Adversarial Voice Conversion	Jun 2, 2021	GPUSpeech Synthesis	CodeCode Available	0
Learning Paralinguistic Features from Audiobooks through Style Voice Conversion	Jun 1, 2021	Emotion RecognitionStyle Detection	—Unverified	0

Show:10 25 50

← PrevPage 7 of 11Next →

All datasets ZeroSpeech 2019 English LibriSpeech test-clean VCTK

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	VQ-CPC	Speaker Similarity	3.8	—	Unverified
2	VQ-VAE	Speaker Similarity	3.49	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	kNN-VC (prematched HiFiGAN)	Character Error Rate (CER)	2.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DISSC	Total Length Error (TLE)	0.83	—	Unverified