Exploration strategies for articulatory synthesis of complex syllable onsets Apr 20, 2022 Speech Synthesis
Code Code Available 0Pushing the Performance of Synthetic Speech Detection with Kolmogorov-Arnold Networks and Self-Supervised Learning Models Jun 17, 2025 Kolmogorov-Arnold Networks Self-Supervised Learning
Code Code Available 0Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language Oct 29, 2018 Speech Synthesis text-to-speech
Code Code Available 0Speaker disentanglement in video-to-speech conversion May 20, 2021 Disentanglement Speech Synthesis
Code Code Available 0Towards a Real-time Measure of the Perception of Anthropomorphism in Human-robot Interaction Jan 24, 2022 Speech Synthesis
Code Code Available 0Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish May 31, 2022 Machine Translation Speech Synthesis
Code Code Available 0Systematic Inequalities in Language Technology Performance across the World's Languages Oct 13, 2021 Dependency Parsing Machine Translation
Code Code Available 0Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis Apr 26, 2021 Language Modeling Language Modelling
Code Code Available 0Systematic Inequalities in Language Technology Performance across the World’s Languages May 1, 2022 Dependency Parsing Machine Translation
Code Code Available 0Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting Oct 8, 2023 Prediction Speech Synthesis
Code Code Available 0Parallel WaveNet: Fast High-Fidelity Speech Synthesis Nov 28, 2017 Speech Synthesis Vocal Bursts Intensity Prediction
Code Code Available 0ECAPA-TDNN for Multi-speaker Text-to-speech Synthesis Mar 20, 2022 Speaker Verification Speech Synthesis
Code Code Available 0RawNet: Fast End-to-End Neural Vocoder Apr 10, 2019 Speech Synthesis
Code Code Available 0Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts May 10, 2022 Speech Synthesis Voice Conversion
Code Code Available 0Intra- and Inter-modal Context Interaction Modeling for Conversational Speech Synthesis Dec 25, 2024 Contrastive Learning Speech Synthesis
Code Code Available 0One-Class Learning with Adaptive Centroid Shift for Audio Deepfake Detection Jun 24, 2024 Audio Deepfake Detection DeepFake Detection
Code Code Available 0Evaluating context-invariance in unsupervised speech representations Oct 27, 2022 Language Modelling speech-recognition
Code Code Available 0DurIAN: Duration Informed Attention Network For Multimodal Synthesis Sep 4, 2019 CPU Speech Synthesis
Code Code Available 0Take the Hint: Improving Arabic Diacritization with Partially-Diacritized Text Jun 6, 2023 Speech Synthesis
Code Code Available 0Recurrent Quantum Neural Networks Jun 25, 2020 Benchmarking BIG-bench Machine Learning
Code Code Available 0Towards Decoding Brain Activity During Passive Listening of Speech Feb 26, 2024 Brain Computer Interface Speech Synthesis
Code Code Available 0Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals Jan 19, 2018 Speech Synthesis Voice Conversion
Code Code Available 0DiVISe: Direct Visual-Input Speech Synthesis Preserving Speaker Characteristics And Intelligibility Mar 7, 2025 Speech Synthesis
Code Code Available 0Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis Apr 1, 2022 Speech Synthesis Voice Conversion
Code Code Available 0OmniDRCA: Parallel Speech-Text Foundation Model via Dual-Resolution Speech Representations and Contrastive Alignment Jun 11, 2025 cross-modal alignment Question Answering
Code Code Available 0Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis Oct 14, 2022 Speech Synthesis Voice Cloning
Code Code Available 0WaveGlow: A Flow-based Generative Network for Speech Synthesis Oct 31, 2018 Audio Synthesis GPU
Code Code Available 0DialogueAgents: A Hybrid Agent-Based Speech Synthesis Framework for Multi-Party Dialogue Apr 20, 2025 Diversity Speech Synthesis
Code Code Available 0Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck Aug 19, 2019 Image Generation Speech Synthesis
Code Code Available 0NVC-Net: End-to-End Adversarial Voice Conversion Jun 2, 2021 GPU Speech Synthesis
Code Code Available 0Integrated Speech and Gesture Synthesis Aug 25, 2021 Speech Synthesis text-to-speech
Code Code Available 0Distilling the Knowledge from Conditional Normalizing Flows Jun 24, 2021 Image Super-Resolution Speech Synthesis
Code Code Available 0SpeechDialogueFactory: Generating High-Quality Speech Dialogue Data to Accelerate Your Speech-LLM Development Mar 31, 2025 Speech Synthesis Voice Cloning
Code Code Available 0Neural Voice Cloning with a Few Samples Feb 14, 2018 Speech Synthesis Voice Cloning
Code Code Available 0Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis Oct 9, 2021 Lifelong learning Speech Synthesis
Code Code Available 0Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting Feb 19, 2024 Language Modeling Language Modelling
Code Code Available 0A Variational Prosody Model for the decomposition and synthesis of speech prosody Jun 22, 2018 Speech Synthesis
Code Code Available 0Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning Oct 20, 2017 GPU Speech Synthesis
Code Code Available 0Neural Autoregressive Flows Apr 3, 2018 Density Estimation Speech Synthesis
Code Code Available 0Multi-view Temporal Alignment for Non-parallel Articulatory-to-Acoustic Speech Synthesis Dec 30, 2020 Dynamic Time Warping MULTI-VIEW LEARNING
Code Code Available 0ConvNeXt Based Neural Network for Audio Anti-Spoofing Sep 14, 2022 image-classification Image Classification
Code Code Available 0Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems May 21, 2019 parameter estimation Speech Synthesis
Code Code Available 0Independent and automatic evaluation of acoustic-to-articulatory inversion models Nov 15, 2019 speech-recognition Speech Recognition
Code Code Available 0Robust and fine-grained prosody control of end-to-end speech synthesis Nov 6, 2018 Expressive Speech Synthesis Speech Synthesis
Code Code Available 0DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021 Oct 25, 2021 Speech Synthesis text-to-speech
Code Code Available 0Audio Codec Augmentation for Robust Collaborative Watermarking of Speech Synthesis Sep 20, 2024 Face Swapping Speech Synthesis
Code Code Available 0VIFS: An End-to-End Variational Inference for Foley Sound Synthesis Jun 8, 2023 Speech Synthesis text-to-speech
Code Code Available 0Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input Jul 5, 2021 Speech Synthesis text-to-speech
Code Code Available 0Attentive Multi-Layer Perceptron for Non-autoregressive Generation Oct 14, 2023 Machine Translation Speech Synthesis
Code Code Available 0Disentangling Speech and Non-Speech Components for Building Robust Acoustic Models from Found Data Sep 25, 2019 speech-recognition Speech Recognition
Code Code Available 0