Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability Apr 3, 2021 Emotion Recognition reinforcement-learning
— Unverified 0Diff-TTS: A Denoising Diffusion Model for Text-to-Speech Apr 3, 2021 Denoising GPU
— Unverified 0Assem-VC: Realistic Voice Conversion by Assembling Modern Speech Synthesis Techniques Apr 2, 2021 Decoder Rhythm
Code Code Available 1Continual Speaker Adaptation for Text-to-Speech Synthesis Mar 26, 2021 Continual Learning Diversity
— Unverified 0SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German Mar 21, 2021 Speech Synthesis
— Unverified 0STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech Mar 17, 2021 Speech Synthesis Style Transfer
— Unverified 0Multilingual Byte2Speech Models for Scalable Low-resource Speech Synthesis Mar 5, 2021 Speech Synthesis
Code Code Available 1Handling Background Noise in Neural Speech Generation Feb 23, 2021 Denoising Speech Synthesis
— Unverified 0Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input Feb 19, 2021 Language Modeling Language Modelling
— Unverified 0AudioVisual Speech Synthesis: A brief literature review Feb 18, 2021 Speech Synthesis text-to-speech
— Unverified 0VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention Feb 12, 2021 Speech Synthesis text-to-speech
— Unverified 0ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech Feb 11, 2021 Speaker Verification Speech Synthesis
— Unverified 0Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 0CDPAM: Contrastive learning for perceptual audio similarity Feb 9, 2021 Contrastive Learning Speech Enhancement
Code Code Available 1Generacion de voces artificiales infantiles en castellano con acento costarricense Feb 2, 2021 Speech Synthesis
— Unverified 0SPEAK WITH YOUR HANDS Using Continuous Hand Gestures to control Articulatory Speech Synthesizer Feb 2, 2021 Speech Synthesis
— Unverified 0Universal Neural Vocoding with Parallel WaveNet Feb 1, 2021 Speech Synthesis
— Unverified 0Expressive Neural Voice Cloning Jan 30, 2021 Speech Synthesis Style Transfer
— Unverified 0Triple M: A Practical Text-to-speech Synthesis System With Multi-guidance Attention And Multi-band Multi-time LPCNet Jan 30, 2021 CPU Sentence
— Unverified 0Generating coherent spontaneous speech and gesture from text Jan 14, 2021 Gesture Generation Motion Generation
— Unverified 0Whispered and Lombard Neural Speech Synthesis Jan 13, 2021 Speaker Verification Speech Synthesis
— Unverified 0EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting Dec 31, 2020 Keyword Spotting Keyword Spotting CSS
Code Code Available 1Text-Free Image-to-Speech Synthesis Using Learned Segmental Units Dec 31, 2020 Image Captioning Speech Synthesis
Code Code Available 1Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic Speech Synthesis Dec 30, 2020 Dynamic Time Warping MULTI-VIEW LEARNING
Code Code Available 0Speech Synthesis as Augmentation for Low-Resource ASR Dec 23, 2020 Data Augmentation speech-recognition
— Unverified 0Parallel WaveNet conditioned on VAE latent vectors Dec 17, 2020 Sentence Speech Synthesis
— Unverified 0Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis Dec 14, 2020 Cultural Vocal Bursts Intensity Prediction Speech Synthesis
— Unverified 0DeepTalk: Vocal Style Encoding for Speaker Recognition and Speech Synthesis Dec 9, 2020 Speaker Recognition Speech Synthesis
Code Code Available 0Adversarial Disentanglement of Speaker Representation for Attribute-Driven Privacy Preservation Dec 8, 2020 Attribute Disentanglement
Code Code Available 0Using previous acoustic context to improve Text-to-Speech synthesis Dec 7, 2020 Decoder Speech Synthesis
— Unverified 0GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis Dec 3, 2020 Decoder Graph Embedding
— Unverified 0German-Arabic Speech-to-Speech Translation for Psychiatric Diagnosis Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0基於深度學習之中文文字轉台語語音合成系統初步探討 (A Preliminary Study on Deep Learning-based Chinese Text to Taiwanese Speech Synthesis System) Dec 1, 2020 Speech Synthesis
— Unverified 0Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System Dec 1, 2020 Articles Emotional Speech Synthesis
— Unverified 0Semi-supervised URL Segmentation with Recurrent Neural Networks Pre-trained on Knowledge Graph Entities Dec 1, 2020 Chinese Word Segmentation Speech Synthesis
Code Code Available 1TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis Nov 24, 2020 Generative Adversarial Network Speech Synthesis
Code Code Available 1TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos Nov 19, 2020 speech-recognition Speech Recognition
— Unverified 0Hierarchical Prosody Modeling for Non-Autoregressive Speech Synthesis Nov 12, 2020 Speech Synthesis text-to-speech
— Unverified 0Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis Nov 10, 2020 Speech Synthesis
— Unverified 0Using GANs to Synthesise Minimum Training Data for Deepfake Generation Nov 10, 2020 Face Swapping Image Generation
— Unverified 0Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Nov 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement Nov 8, 2020 Disentanglement Speech Synthesis
— Unverified 0Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis Nov 6, 2020 Decoder Sentence
— Unverified 0Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis Nov 6, 2020 Decoder Speech Synthesis
Code Code Available 1Semi-supervised URL Segmentation with Recurrent Neural NetworksPre-trained on Knowledge Graph Entities Nov 5, 2020 Chinese Word Segmentation Speech Synthesis
Code Code Available 1Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework Nov 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech Nov 4, 2020 Graph Attention Representation Learning
— Unverified 0Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time Nov 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Training Wake Word Detection with Synthesized Speech Data on Confusion Words Nov 3, 2020 Data Augmentation Keyword Spotting
— Unverified 0Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech Nov 2, 2020 Knowledge Distillation Speech Synthesis
— Unverified 0