BAD: An Assistant tool for making verses in Basque Apr 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS Oct 9, 2024 Diversity Speech Synthesis
— Unverified 0BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model Jul 4, 2022 Language Modeling Language Modelling
— Unverified 0Boosting Large Language Model for Speech Synthesis: An Empirical Study Dec 30, 2023 Language Modeling Language Modelling
— Unverified 0ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis Mar 20, 2022 Speaker Verification Speech Synthesis
Code Code Available 0Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis Oct 9, 2021 Lifelong learning Speech Synthesis
Code Code Available 0MIA-Prognosis: A Deep Learning Framework to Predict Therapy Response Oct 8, 2020 Deep Learning Prognosis
Code Code Available 0Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems May 21, 2019 parameter estimation Speech Synthesis
Code Code Available 0Direct speech-to-speech translation with a sequence-to-sequence model Apr 12, 2019 Speech Synthesis Speech-to-Speech Translation
Code Code Available 0Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers Sep 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Non-Autoregressive Neural Text-to-Speech May 21, 2019 text-to-speech Text to Speech
Code Code Available 0Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting Feb 19, 2024 Language Modeling Language Modelling
Code Code Available 0Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input Jul 5, 2021 Speech Synthesis text-to-speech
Code Code Available 0Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale Jun 23, 2023 In-Context Learning Speech Synthesis
Code Code Available 0The Emotional Voices Database: Towards Controlling the Emotion Dimension in Voice Generation Systems Jun 25, 2018 Speech Emotion Recognition Speech Synthesis
Code Code Available 0Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis Jun 12, 2018 Speaker Verification Speech Synthesis
Code Code Available 0Tools and resources for Romanian text-to-speech and speech-to-text applications Feb 15, 2018 speech-recognition Speech Recognition
Code Code Available 0Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis Apr 26, 2021 Language Modeling Language Modelling
Code Code Available 0Spoofing Speaker Verification Systems with Deep Multi-speaker Text-to-speech Synthesis Oct 29, 2019 Speaker Verification Speech Synthesis
Code Code Available 0Multimodal Latent Language Modeling with Next-Token Diffusion Dec 11, 2024 Image Generation Language Modeling
Code Code Available 0Attentive Multi-Layer Perceptron for Non-autoregressive Generation Oct 14, 2023 Machine Translation Speech Synthesis
Code Code Available 0Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors Oct 25, 2023 en-US domain classification en-US Intent Classification
Code Code Available 0Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis Feb 28, 2020 Speech Synthesis text-to-speech
Code Code Available 0Independent and automatic evaluation of acoustic-to-articulatory inversion models Nov 15, 2019 speech-recognition Speech Recognition
Code Code Available 0Systematic Inequalities in Language Technology Performance across the World's Languages Oct 13, 2021 Dependency Parsing Machine Translation
Code Code Available 0Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish May 31, 2022 Machine Translation Speech Synthesis
Code Code Available 0PriorGrad: Improving Conditional Denoising Diffusion Models with Data-Dependent Adaptive Prior Jun 11, 2021 Audio Generation Denoising
Code Code Available 0Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language Oct 29, 2018 Speech Synthesis text-to-speech
Code Code Available 0Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging Jul 12, 2021 Prediction Speech Synthesis
Code Code Available 0Systematic Inequalities in Language Technology Performance across the World’s Languages May 1, 2022 Dependency Parsing Machine Translation
Code Code Available 0MelNet: A Generative Model for Audio in the Frequency Domain Jun 4, 2019 Audio Generation Music Generation
Code Code Available 0Meta Learning Text-to-Speech Synthesis in over 7000 Languages Jun 10, 2024 Meta-Learning Speech Synthesis
Code Code Available 0