Versatile Speech Databases for High Quality Synthesis for Basque May 1, 2012 Emotional Speech Synthesis Speech Synthesis
— Unverified 00 Vers une annotation automatique de corpus audio pour la synth\`ese de parole (Towards Fully Automatic Annotation of Audio Books for Text-To-Speech (TTS) Synthesis) [in French] Jun 1, 2012 Speech Synthesis text-to-speech
— Unverified 00 VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing Nov 30, 2022 Machine Translation Sentence
— Unverified 00 Video-to-Video Translation for Visual Speech Synthesis May 28, 2019 Image-to-Image Translation Speech Synthesis
— Unverified 00 Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection Jun 15, 2022 feature selection Speech Synthesis
— Unverified 00 Visatronic: A Multimodal Decoder-Only Model for Speech Synthesis Nov 26, 2024 Decoder multimodal generation
— Unverified 00 Visual-Aware Text-to-Speech Jun 21, 2023 Rhythm Speech Synthesis
— Unverified 00 VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over Oct 7, 2021 Speech Synthesis text-to-speech
— Unverified 00 VNet: A GAN-based Multi-Tier Discriminator Network for Speech Synthesis Vocoders Aug 13, 2024 Speech Synthesis
— Unverified 00 Vocoder-Based Speech Synthesis from Silent Videos Apr 6, 2020 Multi-Task Learning Speech Synthesis
— Unverified 00 Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 00 Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer Sep 3, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Voice Conversion for Whispered Speech Synthesis Dec 11, 2019 Speech Synthesis Voice Conversion
— Unverified 00 Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities Apr 10, 2017 speech-recognition Speech Recognition
— Unverified 00 VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models Apr 3, 2025 Speech Synthesis
— Unverified 00 VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis Dec 26, 2024 Audio Generation Speech Synthesis
— Unverified 00 Voice Filter: Few-shot text-to-speech speaker adaptation using voice conversion as a post-processing module Feb 16, 2022 Speech Synthesis text-to-speech
— Unverified 00 VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis Mar 1, 2024 Speech Synthesis
— Unverified 00 VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka Sep 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks Sep 14, 2023 Decoder Language Modeling
— Unverified 00 VQalAttent: a Transparent Speech Generation Pipeline based on Transformer-learned VQ-VAE Latent Space Nov 22, 2024 Audio Synthesis Decoder
— Unverified 00 VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature Apr 2, 2022 Speech Synthesis text-to-speech
— Unverified 00 Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder Jul 31, 2018 Generative Adversarial Network Speech Synthesis
— Unverified 00 WaveCycleGAN2: Time-domain Neural Post-filter for Speech Waveform Generation Apr 5, 2019 Speech Synthesis
— Unverified 00 WaveCycleGAN: Synthetic-to-natural speech waveform conversion using cycle-consistent adversarial networks Sep 25, 2018 Speech Synthesis Voice Conversion
— Unverified 00 Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks Oct 30, 2018 Image Generation Speech Synthesis
— Unverified 00 Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis Mar 24, 2023 Generative Adversarial Network Speech Synthesis
— Unverified 00 WavThruVec: Latent speech representation as intermediate features for neural speech synthesis Mar 31, 2022 Speech Synthesis text-to-speech
— Unverified 00 Weakly-supervised text-to-speech alignment confidence measure Dec 1, 2016 speech-recognition Speech Recognition
— Unverified 00 WebWOZ: A Platform for Designing and Conducting Web-based Wizard of Oz Experiments Aug 1, 2013 Machine Translation Speech Recognition
— Unverified 00 We Need Variations in Speech Generation: Sub-center Modelling for Speaker Embeddings Jul 5, 2024 Speaker Recognition Speech Synthesis
— Unverified 00 What happens to diffusion model likelihood when your model is conditional? Sep 10, 2024 domain classification model
— Unverified 00 What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS Sep 4, 2020 Decoder Sentence
— Unverified 00 Which Prosodic Features Matter Most for Pragmatics? Aug 23, 2024 Speech Synthesis
— Unverified 00 Which Synthetic Voice Should I Choose for an Evocative Task? Sep 1, 2015 Speech Synthesis Text-To-Speech Synthesis
— Unverified 00 Whispered and Lombard Neural Speech Synthesis Jan 13, 2021 Speaker Verification Speech Synthesis
— Unverified 00 Whither the Priors for (Vocal) Interactivity? Mar 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 WinkTalk: a demonstration of a multimodal speech synthesis platform linking facial expressions to expressive synthetic voices Jun 1, 2012 Speech Synthesis
— Unverified 00 WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis Jun 20, 2022 CPU Speech Synthesis
— Unverified 00 Word-Level Style Control for Expressive, Non-attentive Speech Synthesis Nov 19, 2021 Expressive Speech Synthesis Speech Synthesis
— Unverified 00 應用文脈分析於中英夾雜語音合成系統(Linguistic Analysis for English/Mandarin Speech Synthesis System) Oct 1, 2019 Speech Synthesis
— Unverified 00 You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation May 14, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Zero-Shot Long-Form Voice Cloning with Dynamic Convolution Attention Jan 25, 2022 Form Speech Synthesis
— Unverified 00 Zero-Shot Mono-to-Binaural Speech Synthesis Dec 11, 2024 Audio Synthesis Denoising
— Unverified 00 Zero-shot personalized lip-to-speech synthesis with face image based voice control May 9, 2023 Lip to Speech Synthesis Representation Learning
— Unverified 00 Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling May 26, 2025 Sentence Speech Synthesis
— Unverified 00 Zero-shot text-to-speech synthesis conditioned using self-supervised speech representation model Apr 24, 2023 Rhythm Self-Supervised Learning
— Unverified 00 ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models May 23, 2023 Speech Synthesis text-to-speech
— Unverified 00 整合語者嵌入向量與後置濾波器於提升個人化合成語音之語者相似度 (Incorporating Speaker Embedding and Post-Filter Network for Improving Speaker Similarity of Personalized Speech Synthesis System) Dec 1, 2021 Speech Synthesis
— Unverified 00