Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? Jun 11, 2024 Contrastive Learning Speech Synthesis
— Unverified 00 Extending Multilingual Speech Synthesis to 100+ Languages without Transcribed Data Feb 29, 2024 Representation Learning Speech Synthesis
— Unverified 00 Enhancing Multilingual Speech Generation and Recognition Abilities in LLMs with Constructed Code-switched Data Sep 17, 2024 Speech Synthesis
— Unverified 00 Enhancing audio quality for expressive Neural Text-to-Speech Aug 13, 2021 Acoustic Modelling Speech Synthesis
— Unverified 00 A Preliminary Study on Mandarin-Hakka neural machine translation using small-sized data Nov 1, 2022 Machine Translation Speech Synthesis
— Unverified 00 Face-StyleSpeech: Enhancing Zero-shot Speech Synthesis from Face Images with Improved Face-to-Speech Mapping Sep 25, 2023 Speech Synthesis text-to-speech
— Unverified 00 Facial Expression-Enhanced TTS: Combining Face Representation and Emotion Intensity for Adaptive Speech Sep 24, 2024 Emotional Speech Synthesis Speech Synthesis
— Unverified 00 FADEL: Uncertainty-aware Fake Audio Detection with Evidential Deep Learning Apr 22, 2025 Deep Learning Speaker Verification
— Unverified 00 FA-GAN: Artifacts-free and Phase-aware High-fidelity GAN-based Vocoder Jul 5, 2024 Generative Adversarial Network Speech Synthesis
— Unverified 00 A Flow-Based Neural Network for Time Domain Speech Enhancement Jun 16, 2021 Density Estimation Speech Enhancement
— Unverified 00 fairseq Sˆ2: A Scalable and Integrable Speech Synthesis Toolkit Nov 1, 2021 Speech Synthesis text-to-speech
— Unverified 00 Fast and Accurate Decision Trees for Natural Language Processing Tasks Sep 1, 2017 Attribute BIG-bench Machine Learning
— Unverified 00 A comparison of Vietnamese Statistical Parametric Speech Synthesis Systems May 26, 2020 GPU Speech Synthesis
— Unverified 00 Fast and small footprint Hybrid HMM-HiFiGAN based system for speech synthesis in Indian languages Feb 13, 2023 Speech Synthesis text-to-speech
— Unverified 00 Fast Bootstrapping of Grapheme to Phoneme System for Under-resourced Languages - Application to the Iban Language Oct 1, 2013 Speech Recognition Speech Synthesis
— Unverified 00 Fast, Compact, and High Quality LSTM-RNN Based Statistical Parametric Speech Synthesizers for Mobile Devices Jun 20, 2016 Quantization Speech Synthesis
— Unverified 00 A Bengali Speech Synthesizer on Android OS Jul 1, 2012 Speech Synthesis
— Unverified 00 Energy-Based Models For Speech Synthesis Oct 19, 2023 Speech Synthesis
— Unverified 00 Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end? Sep 12, 2023 Self-Supervised Learning Speech Synthesis
— Unverified 00 End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks Apr 27, 2021 Lip Reading Speech Synthesis
— Unverified 00 End-to-End Text-to-Speech using Latent Duration based on VQ-VAE Oct 19, 2020 Speech Synthesis text-to-speech
— Unverified 00 Fast Spectrogram Inversion using Multi-head Convolutional Neural Networks Aug 20, 2018 speech-recognition Speech Recognition
— Unverified 00 A Preliminary Study on Deep Learning-based Chinese Text to Taiwanese Speech Synthesis System Sep 1, 2020 Speech Synthesis
— Unverified 00 CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center May 23, 2023 Speech Synthesis
— Unverified 00 End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator Oct 31, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 End-to-End Emotional Speech Synthesis Using Style Tokens and Semi-Supervised Training Jun 26, 2019 Emotional Speech Synthesis Emotion Recognition
— Unverified 00 Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes Nov 22, 2018 All speech-recognition
— Unverified 00 A Practical Guide to Logical Access Voice Presentation Attack Detection Jan 10, 2022 Artifact Detection Speaker Verification
— Unverified 00 Fine-grained Noise Control for Multispeaker Speech Synthesis Apr 11, 2022 Expressive Speech Synthesis Speech Synthesis
— Unverified 00 AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis Aug 16, 2023 Attribute Speech Synthesis
— Unverified 00 Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement Nov 8, 2020 Disentanglement Speech Synthesis
— Unverified 00 Fitting New Speakers Based on a Short Untranscribed Sample Feb 20, 2018 Speech Synthesis text-to-speech
— Unverified 00 End-to-End Binaural Speech Synthesis Jul 8, 2022 Decoder Speech Synthesis
— Unverified 00 Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features Apr 8, 2021 Decoder Speech Synthesis
— Unverified 00 FleSpeech: Flexibly Controllable Speech Generation with Various Prompts Jan 8, 2025 Speech Synthesis
— Unverified 00 ELF: Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis Nov 20, 2023 Speech Synthesis
— Unverified 00 EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech Mar 13, 2024 GPU Speech Synthesis
— Unverified 00 FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis Jun 30, 2024 CPU Decoder
— Unverified 00 BU-TTS: An Open-Source, Bilingual Welsh-English, Text-to-Speech Corpus Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 00 Applying Syntaxx2013Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis Mar 29, 2022 Speech Synthesis text-to-speech
— Unverified 00 Empowering Global Voices: A Data-Efficient, Phoneme-Tone Adaptive Approach to High-Fidelity Speech Synthesis Apr 10, 2025 Speech Synthesis text-to-speech
— Unverified 00 Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis Jul 18, 2018 Acoustic Modelling Decoder
— Unverified 00 FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model Mar 6, 2023 Language Modeling Language Modelling
— Unverified 00 From `Solved Problems' to New Challenges: A Report on LDC Activities May 1, 2018 Dialogue Management Language Identification
— Unverified 00 Building Text-To-Speech Voices in the Cloud May 1, 2012 Speech Recognition Speech Synthesis
— Unverified 00 From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars Jun 16, 2025 GPU Speech Synthesis
— Unverified 00 From Speaker Identification to Affective Analysis: A Multi-Step System for Analyzing Children's Stories Apr 1, 2014 Age Estimation Speaker Identification
— Unverified 00 Empirical Analysis of Oral and Nasal Vowels of Konkani May 17, 2023 Speech Synthesis
— Unverified 00 Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement Aug 27, 2021 Audio Signal Processing Speech Enhancement
— Unverified 00 Emphasized Accent Phrase Prediction from Text for Advertisement Text-To-Speech Synthesis Dec 1, 2014 Speech Synthesis text-to-speech
— Unverified 00