A Corpus of Neutral Voice Speech in Brazilian Portuguese May 21, 2021 Speech Synthesis text-to-speech
— Unverified 0Speaker disentanglement in video-to-speech conversion May 20, 2021 Disentanglement Speech Synthesis
Code Code Available 0Learning Robust Latent Representations for Controllable Speech Synthesis May 10, 2021 Speech Synthesis text-to-speech
— Unverified 0Towards a practical lip-to-speech conversion system using deep neural networks and mobile application frontend Apr 29, 2021 Lip to Speech Synthesis Speech Synthesis
— Unverified 0End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks Apr 27, 2021 Lip Reading Speech Synthesis
— Unverified 0Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis Apr 26, 2021 Language Modeling Language Modelling
Code Code Available 0An Adaptive Learning based Generative Adversarial Network for One-To-One Voice Conversion Apr 25, 2021 Generative Adversarial Network Speech Synthesis
— Unverified 0Review of end-to-end speech synthesis technology based on deep learning Apr 20, 2021 Speech Synthesis
— Unverified 0Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis Apr 14, 2021 Dependency Parsing Representation Learning
— Unverified 0Half-Truth: A Partially Fake Audio Detection Dataset Apr 8, 2021 Speech Synthesis
Code Code Available 0Towards Multi-Scale Style Control for Expressive Speech Synthesis Apr 8, 2021 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features Apr 8, 2021 Decoder Speech Synthesis
— Unverified 0Diff-TTS: A Denoising Diffusion Model for Text-to-Speech Apr 3, 2021 Denoising GPU
— Unverified 0Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability Apr 3, 2021 Emotion Recognition reinforcement-learning
— Unverified 0Continual Speaker Adaptation for Text-to-Speech Synthesis Mar 26, 2021 Continual Learning Diversity
— Unverified 0SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German Mar 21, 2021 Speech Synthesis
— Unverified 0STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech Mar 17, 2021 Speech Synthesis Style Transfer
— Unverified 0Handling Background Noise in Neural Speech Generation Feb 23, 2021 Denoising Speech Synthesis
— Unverified 0Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input Feb 19, 2021 Language Modeling Language Modelling
— Unverified 0AudioVisual Speech Synthesis: A brief literature review Feb 18, 2021 Speech Synthesis text-to-speech
— Unverified 0VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention Feb 12, 2021 Speech Synthesis text-to-speech
— Unverified 0ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech Feb 11, 2021 Speaker Verification Speech Synthesis
— Unverified 0Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 0Generacion de voces artificiales infantiles en castellano con acento costarricense Feb 2, 2021 Speech Synthesis
— Unverified 0SPEAK WITH YOUR HANDS Using Continuous Hand Gestures to control Articulatory Speech Synthesizer Feb 2, 2021 Speech Synthesis
— Unverified 0Universal Neural Vocoding with Parallel WaveNet Feb 1, 2021 Speech Synthesis
— Unverified 0Triple M: A Practical Text-to-speech Synthesis System With Multi-guidance Attention And Multi-band Multi-time LPCNet Jan 30, 2021 CPU Sentence
— Unverified 0Expressive Neural Voice Cloning Jan 30, 2021 Speech Synthesis Style Transfer
— Unverified 0Generating coherent spontaneous speech and gesture from text Jan 14, 2021 Gesture Generation Motion Generation
— Unverified 0Whispered and Lombard Neural Speech Synthesis Jan 13, 2021 Speaker Verification Speech Synthesis
— Unverified 0Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic Speech Synthesis Dec 30, 2020 Dynamic Time Warping MULTI-VIEW LEARNING
Code Code Available 0Speech Synthesis as Augmentation for Low-Resource ASR Dec 23, 2020 Data Augmentation speech-recognition
— Unverified 0Parallel WaveNet conditioned on VAE latent vectors Dec 17, 2020 Sentence Speech Synthesis
— Unverified 0Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis Dec 14, 2020 Cultural Vocal Bursts Intensity Prediction Speech Synthesis
— Unverified 0DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis Dec 9, 2020 Speaker Recognition Speech Synthesis
Code Code Available 0Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation Dec 8, 2020 Attribute Disentanglement
Code Code Available 0Using previous acoustic context to improve Text-to-Speech synthesis Dec 7, 2020 Decoder Speech Synthesis
— Unverified 0GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis Dec 3, 2020 Decoder Graph Embedding
— Unverified 0Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System Dec 1, 2020 Articles Emotional Speech Synthesis
— Unverified 0基於深度學習之中文文字轉台語語音合成系統初步探討 (A Preliminary Study on Deep Learning-based Chinese Text to Taiwanese Speech Synthesis System) Dec 1, 2020 Speech Synthesis
— Unverified 0German-Arabic Speech-to-Speech Translation for Psychiatric Diagnosis Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos Nov 19, 2020 speech-recognition Speech Recognition
— Unverified 0Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis Nov 12, 2020 Speech Synthesis text-to-speech
— Unverified 0Using GANs to Synthesise Minimum Training Data for Deepfake Generation Nov 10, 2020 Face Swapping Image Generation
— Unverified 0Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis Nov 10, 2020 Speech Synthesis
— Unverified 0Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Nov 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement Nov 8, 2020 Disentanglement Speech Synthesis
— Unverified 0Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis Nov 6, 2020 Decoder Sentence
— Unverified 0Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework Nov 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech Nov 4, 2020 Graph Attention Representation Learning
— Unverified 0