Using NLG for speech synthesis of mathematical sentences Oct 1, 2019 Sentence Speech Synthesis
— Unverified 0Using previous acoustic context to improve Text-to-Speech synthesis Dec 7, 2020 Decoder Speech Synthesis
— Unverified 0Using VAEs and Normalizing Flows for One-shot Text-To-Speech Synthesis of Expressive Speech Nov 28, 2019 Disentanglement Expressive Speech Synthesis
— Unverified 0Utterance-level Sequential Modeling For Deep Gaussian Process Based Speech Synthesis Using Simple Recurrent Unit Apr 22, 2020 Speech Synthesis
— Unverified 0Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE Jun 6, 2022 Representation Learning Speech Representation Learning
— Unverified 0UzbekTagger: The rule-based POS tagger for Uzbek language Jan 30, 2023 Language Modeling Language Modelling
— Unverified 0VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages May 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers Jun 8, 2024 Speech Synthesis text-to-speech
— Unverified 0VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment Jun 12, 2024 Quantization Speech Synthesis
— Unverified 0VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation Mar 14, 2023 Disentanglement Speech Synthesis
— Unverified 0VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention Feb 12, 2021 Speech Synthesis text-to-speech
— Unverified 0Variations prosodiques en synth\`ese par s\'election d'unit\'es: l'exemple des phrases interrogatives (Prosodic variations in unit-based speech synthesis: the example of interrogative sentences) [in French] Jun 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion Feb 18, 2022 Quantization Speech Synthesis
— Unverified 0vec2wav 2.0: Advancing Voice Conversion via Discrete Token Vocoders Sep 3, 2024 Speech Synthesis Voice Conversion
— Unverified 0Versatile Speech Databases for High Quality Synthesis for Basque May 1, 2012 Emotional Speech Synthesis Speech Synthesis
— Unverified 0Vers une annotation automatique de corpus audio pour la synth\`ese de parole (Towards Fully Automatic Annotation of Audio Books for Text-To-Speech (TTS) Synthesis) [in French] Jun 1, 2012 Speech Synthesis text-to-speech
— Unverified 0VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing Nov 30, 2022 Machine Translation Sentence
— Unverified 0Video-to-Video Translation for Visual Speech Synthesis May 28, 2019 Image-to-Image Translation Speech Synthesis
— Unverified 0Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection Jun 15, 2022 feature selection Speech Synthesis
— Unverified 0Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis Nov 26, 2024 Decoder multimodal generation
— Unverified 0Visual-Aware Text-to-Speech Jun 21, 2023 Rhythm Speech Synthesis
— Unverified 0VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over Oct 7, 2021 Speech Synthesis text-to-speech
— Unverified 0VNet: A GAN-based Multi-Tier Discriminator Network for Speech Synthesis Vocoders Aug 13, 2024 Speech Synthesis
— Unverified 0Vocoder-Based Speech Synthesis from Silent Videos Apr 6, 2020 Multi-Task Learning Speech Synthesis
— Unverified 0Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 0Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer Sep 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Voice Conversion for Whispered Speech Synthesis Dec 11, 2019 Speech Synthesis Voice Conversion
— Unverified 0Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities Apr 10, 2017 speech-recognition Speech Recognition
— Unverified 0VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models Apr 3, 2025 Speech Synthesis
— Unverified 0VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis Dec 26, 2024 Audio Generation Speech Synthesis
— Unverified 0Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module Feb 16, 2022 Speech Synthesis text-to-speech
— Unverified 0VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis Mar 1, 2024 Speech Synthesis
— Unverified 0VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka Sep 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks Sep 14, 2023 Decoder Language Modeling
— Unverified 0VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space Nov 22, 2024 Audio Synthesis Decoder
— Unverified 0VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature Apr 2, 2022 Speech Synthesis text-to-speech
— Unverified 0Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder Jul 31, 2018 Generative Adversarial Network Speech Synthesis
— Unverified 0WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation Apr 5, 2019 Speech Synthesis
— Unverified 0WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks Sep 25, 2018 Speech Synthesis Voice Conversion
— Unverified 0Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks Oct 30, 2018 Image Generation Speech Synthesis
— Unverified 0Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis Mar 24, 2023 Generative Adversarial Network Speech Synthesis
— Unverified 0WavThruVec: Latent speech representation as intermediate features for neural speech synthesis Mar 31, 2022 Speech Synthesis text-to-speech
— Unverified 0Weakly-supervised text-to-speech alignment confidence measure Dec 1, 2016 speech-recognition Speech Recognition
— Unverified 0WebWOZ: A Platform for Designing and Conducting Web-based Wizard of Oz Experiments Aug 1, 2013 Machine Translation Speech Recognition
— Unverified 0We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings Jul 5, 2024 Speaker Recognition Speech Synthesis
— Unverified 0What happens to diffusion model likelihood when your model is conditional? Sep 10, 2024 domain classification model
— Unverified 0What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS Sep 4, 2020 Decoder Sentence
— Unverified 0Which Prosodic Features Matter Most for Pragmatics? Aug 23, 2024 Speech Synthesis
— Unverified 0Which Synthetic Voice Should I Choose for an Evocative Task? Sep 1, 2015 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0