AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Nov 9, 2018 GPU Image Captioning
— Unverified 0EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models Sep 22, 2022 Speech Synthesis text-to-speech
— Unverified 0Exploring Transfer Learning for Urdu Speech Synthesis Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 0CASSANDRA: A multipurpose configurable voice-enabled human-computer-interface Apr 1, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fast Bootstrapping of Grapheme to Phoneme System for Under-resourced Languages - Application to the Iban Language Oct 1, 2013 Speech Recognition Speech Synthesis
— Unverified 0Environment Aware Text-to-Speech Synthesis Oct 8, 2021 Attribute Disentanglement
— Unverified 0Building Text-to-Speech Systems for Resource Poor Languages May 1, 2012 Clustering Speech Synthesis
— Unverified 0Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback Jun 2, 2024 Speech Synthesis text-to-speech
— Unverified 0Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis May 1, 2012 Audio-Visual Speech Recognition Speech Recognition
— Unverified 0An In-depth Analysis of the Effect of Text Normalization in Social Media May 1, 2015 Dependency Parsing named-entity-recognition
— Unverified 0Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement Nov 8, 2020 Disentanglement Speech Synthesis
— Unverified 0Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features Apr 8, 2021 Decoder Speech Synthesis
— Unverified 0Adaptive Parser-Centric Text Normalization Aug 1, 2013 Machine Translation Speech Recognition
— Unverified 0FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis Jun 30, 2024 CPU Decoder
— Unverified 0FMSD-TTS: Few-shot Multi-Speaker Multi-Dialect Text-to-Speech Synthesis for Ü-Tsang, Amdo and Kham Speech Dataset Generation May 20, 2025 Dataset Generation Speech Synthesis
— Unverified 0Full-text Error Correction for Chinese Speech Recognition with Large Language Model Sep 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LDC Forced Aligner May 1, 2012 Sentence Speech Recognition
— Unverified 0Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis Mar 14, 2019 Generative Adversarial Network Speech Synthesis
— Unverified 0End-to-End Text-to-Speech using Latent Duration based on VQ-VAE Oct 19, 2020 Speech Synthesis text-to-speech
— Unverified 0BUCEADOR, a multi-language search engine for digital libraries May 1, 2012 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Generative Pre-training for Speech with Flow Matching Oct 25, 2023 Speech Enhancement Speech Synthesis
— Unverified 0Generative Semantic Communication for Text-to-Speech Synthesis Oct 4, 2024 Quantization Semantic Communication
— Unverified 0End-to-End Feedback Loss in Speech Chain Framework via Straight-Through Estimator Oct 31, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Boosting Large Language Model for Speech Synthesis: An Empirical Study Dec 30, 2023 Language Modeling Language Modelling
— Unverified 0A Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source Performance May 1, 2016 Speech Synthesis text-to-speech
— Unverified 0An Investigation of the Relation Between Grapheme Embeddings and Pronunciation for Tacotron-based Systems Oct 21, 2020 Grapheme-to-Phoneme Conversion Relation
— Unverified 0Emphasized Accent Phrase Prediction from Text for Advertisement Text-To-Speech Synthesis Dec 1, 2014 Speech Synthesis text-to-speech
— Unverified 0BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model Jul 4, 2022 Language Modeling Language Modelling
— Unverified 0Efficient training strategies for natural sounding speech synthesis and speaker adaptation based on FastPitch Oct 9, 2024 Speech Synthesis text-to-speech
— Unverified 0ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech Feb 13, 2025 Adversarial Attack Adversarial Attack Detection
— Unverified 0An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis Dec 8, 2023 Benchmarking Quantization
— Unverified 0Accent conversion using discrete units with parallel data synthesized from controllable accented TTS Sep 30, 2024 Data Augmentation Speech Synthesis
— Unverified 0Efficient Generative Modeling with Residual Vector Quantization-Based Tokens Dec 13, 2024 Conditional Image Generation Image Generation
— Unverified 0Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS Oct 9, 2024 Diversity Speech Synthesis
— Unverified 0Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment Oct 28, 2019 Hard Attention Speech Synthesis
— Unverified 0BAD: An Assistant tool for making verses in Basque Apr 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0AS-Speech: Adaptive Style For Speech Synthesis Sep 9, 2024 Rhythm Speech Synthesis
— Unverified 0DurIAN-E: Duration Informed Attention Network For Expressive Text-to-Speech Synthesis Sep 22, 2023 Denoising Speech Synthesis
— Unverified 0A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models Apr 22, 2025 cross-modal alignment Script Generation
— Unverified 0A Challenge Set and Methods for Noun-Verb Ambiguity Oct 1, 2018 Speech Synthesis text-to-speech
— Unverified 0Large tagset labeling using Feed Forward Neural Networks. Case study on Romanian Language Aug 1, 2013 Machine Translation Part-Of-Speech Tagging
— Unverified 0DurIAN-E 2: Duration Informed Attention Network with Adaptive Variational Autoencoder and Adversarial Learning for Expressive Text-to-Speech Synthesis Oct 17, 2024 Speech Synthesis text-to-speech
— Unverified 0Duration Modeling by Multi-Models based on Vowel Production characteristics Dec 1, 2014 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis Apr 14, 2025 RAG Retrieval-augmented Generation
— Unverified 0Dual Script E2E framework for Multilingual and Code-Switching ASR Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Do Prosody Transfer Models Transfer Prosody? Mar 7, 2023 Speech Synthesis text-to-speech
— Unverified 0Autoregressive Speech Synthesis without Vector Quantization Jul 11, 2024 Audio Compression Diversity
— Unverified 0A Review of Deep Learning Techniques for Speech Processing Apr 30, 2023 Automatic Speech Recognition Deep Learning
— Unverified 0DNN-based Speech Synthesis for Indian Languages from ASCII text Aug 18, 2016 Speech Synthesis text-to-speech
— Unverified 0Applying Syntaxx2013Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis Mar 29, 2022 Speech Synthesis text-to-speech
— Unverified 0