Audio Codec Augmentation for Robust Collaborative Watermarking of Speech Synthesis Sep 20, 2024 Face Swapping Speech Synthesis
Code Code Available 05 GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram Apr 8, 2019 Speech Synthesis text-to-speech
Code Code Available 05 High Fidelity Speech Synthesis with Adversarial Networks Sep 25, 2019 Generative Adversarial Network Speech Synthesis
Code Code Available 05 Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting Feb 19, 2024 Language Modeling Language Modelling
Code Code Available 05 Attentive Multi-Layer Perceptron for Non-autoregressive Generation Oct 14, 2023 Machine Translation Speech Synthesis
Code Code Available 05 ConvNeXt Based Neural Network for Audio Anti-Spoofing Sep 14, 2022 image-classification Image Classification
Code Code Available 05 FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis Jul 8, 2022 Lip to Speech Synthesis Speech Synthesis
Code Code Available 05 PromptTTS: Controllable Text-to-Speech with Text Descriptions Nov 22, 2022 Decoder Speech Synthesis
Code Code Available 05 fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit Sep 14, 2021 Speech Synthesis text-to-speech
Code Code Available 05 Adapter-Based Extension of Multi-Speaker Text-to-Speech Model for New Speakers Nov 1, 2022 parameter-efficient fine-tuning Speech Synthesis
Code Code Available 05 Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging Jul 12, 2021 Prediction Speech Synthesis
Code Code Available 05 fairseq Sˆ2: A Scalable and Integrable Speech Synthesis Toolkit Nov 1, 2021 Speech Synthesis text-to-speech
Code Code Available 05 Evaluating context-invariance in unsupervised speech representations Oct 27, 2022 Language Modelling speech-recognition
Code Code Available 05 Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals Jan 19, 2018 Speech Synthesis Voice Conversion
Code Code Available 05 Empirical Study Incorporating Linguistic Knowledge on Filled Pauses for Personalized Spontaneous Speech Synthesis Oct 14, 2022 Speech Synthesis Voice Cloning
Code Code Available 05 Exploration strategies for articulatory synthesis of complex syllable onsets Apr 20, 2022 Speech Synthesis
Code Code Available 05 Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation Mar 31, 2024 Language Modeling Language Modelling
Code Code Available 05 Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq May 25, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Continuous Speech Synthesis using per-token Latent Diffusion Oct 21, 2024 Image Generation Quantization
— Unverified 00 Continuous Expressive Speaking Styles Synthesis based on CVSM and MR-HMM Dec 1, 2016 Expressive Speech Synthesis Speech Recognition
— Unverified 00 Semi-supervised learning for continuous emotional intensity controllable speech synthesis with disentangled representations Nov 11, 2022 Emotional Speech Synthesis Speech Synthesis
— Unverified 00 Continuous Autoregressive Modeling with Stochastic Monotonic Alignment for Speech Synthesis Feb 3, 2025 Quantization Speech Synthesis
— Unverified 00 A Machine of Few Words -- Interactive Speaker Recognition with Reinforcement Learning Aug 7, 2020 Decision Making reinforcement-learning
— Unverified 00 Continual Speaker Adaptation for Text-to-Speech Synthesis Mar 26, 2021 Continual Learning Diversity
— Unverified 00 Contextual Expressive Text-to-Speech Nov 26, 2022 Speech Synthesis text-to-speech
— Unverified 00 A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond Feb 17, 2025 Contrastive Learning EEG
— Unverified 00 Constructive Interaction for Talking about Interesting Topics May 1, 2012 Management Speech Recognition
— Unverified 00 Construction of English-French Multimodal Affective Conversational Corpus from TV Dramas May 1, 2018 Emotion Recognition Speech Recognition
— Unverified 00 A Survey of Voice Translation Methodologies - Acoustic Dialect Decoder Oct 13, 2016 Decoder Sentence
— Unverified 00 Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input Feb 19, 2021 Language Modeling Language Modelling
— Unverified 00 Conditioning Sequence-to-sequence Networks with Learned Activations Sep 29, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Conditional Spoken Digit Generation with StyleGAN Sep 15, 2020 Image Generation Speech Synthesis
— Unverified 00 A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis Aug 3, 2022 Speech Synthesis text-to-speech
— Unverified 00 A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate Aug 9, 2021 Speech Synthesis
— Unverified 00 Aligning phonemes using finte-state methods May 1, 2017 Speech Synthesis Spelling Correction
— Unverified 00 CONCSS: Contrastive-based Context Comprehension for Dialogue-appropriate Prosody in Conversational Speech Synthesis Dec 16, 2023 Contrastive Learning Self-Supervised Learning
— Unverified 00 Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need Jul 2, 2022 All Speech Synthesis
— Unverified 00 AS-Speech: Adaptive Style For Speech Synthesis Sep 9, 2024 Rhythm Speech Synthesis
— Unverified 00 Computer-Aided Quality Assurance of an Icelandic Pronunciation Dictionary May 1, 2014 speech-recognition Speech Recognition
— Unverified 00 Complete reconstruction of the tongue contour through acoustic to articulatory inversion using real-time MRI data Nov 4, 2024 Speech Synthesis
— Unverified 00 Assessing Evaluation Metrics for Speech-to-Speech Translation Oct 26, 2021 Machine Translation Open-Ended Question Answering
— Unverified 00 Aligning Opinions: Cross-Lingual Opinion Mining with Dependencies Jul 1, 2015 Coreference Resolution Named Entity Recognition (NER)
— Unverified 00 Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History Jun 16, 2022 Self-Supervised Learning Sentence
— Unverified 00 Accelerating Codec-based Speech Synthesis with Multi-Token Prediction and Speculative Decoding Oct 17, 2024 Speech Synthesis
— Unverified 00 Comparing performance of different set-covering strategies for linguistic content optimization in speech corpora May 1, 2012 Descriptive Speech Recognition
— Unverified 00 Comparing normalizing flows and diffusion models for prosody and acoustic modelling in text-to-speech Jul 31, 2023 Acoustic Modelling Speech Synthesis
— Unverified 00 Compact Neural TTS Voices for Accessibility Jan 28, 2025 Speech Synthesis text-to-speech
— Unverified 00 ASR-based Features for Emotion Recognition: A Transfer Learning Approach May 23, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation Apr 29, 2025 In-Context Learning Speech Synthesis
— Unverified 00 Combining Manual and Automatic Prosodic Annotation for Expressive Speech Synthesis May 1, 2016 Expressive Speech Synthesis Speech Synthesis
— Unverified 00