Guided Flows for Generative Modeling and Decision Making Nov 22, 2023 Conditional Image Generation Decision Making
— Unverified 0Guided-TTS:Text-to-Speech with Untranscribed Speech Sep 29, 2021 Speech Synthesis text-to-speech
— Unverified 0Guided-TTS: A Diffusion Model for Text-to-Speech via Classifier Guidance Nov 23, 2021 speech-recognition Speech Recognition
— Unverified 0HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis Oct 6, 2024 Language Modeling Language Modelling
— Unverified 0Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis Sep 17, 2020 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Hierarchical Representation of Prosody for Statistical Speech Synthesis Oct 7, 2015 Speech Synthesis text-to-speech
— Unverified 0High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units Jun 29, 2023 Speech Synthesis text-to-speech
— Unverified 0Hippocratic Abbreviation Expansion Jun 1, 2014 Information Retrieval Machine Translation
— Unverified 0HMM-based Mandarin Singing Voice Synthesis Using Tailored Synthesis Units and Question Sets Dec 1, 2013 Singing Voice Synthesis Speech Synthesis
— Unverified 0UzbekTagger: The rule-based POS tagger for Uzbek language Jan 30, 2023 Language Modeling Language Modelling
— Unverified 0VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages May 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis Jul 4, 2024 Accented Speech Recognition Automatic Speech Recognition
— Unverified 0Improving Audio Codec-based Zero-Shot Text-to-Speech Synthesis with Multi-Modal Context and Large Language Model Jun 6, 2024 Language Modeling Language Modelling
— Unverified 0Improving homograph disambiguation with supervised machine learning May 1, 2018 BIG-bench Machine Learning Speech Synthesis
— Unverified 0Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis Dec 22, 2024 Decoder Disentanglement
— Unverified 0Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time Nov 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Incremental Text-to-Speech Synthesis with Prefix-to-Prefix Framework Nov 7, 2019 Sentence Speech Synthesis
— Unverified 0Individuality-Preserving Spectrum Modification for Articulation Disorders Using Phone Selective Synthesis Sep 1, 2015 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers Jun 8, 2024 Speech Synthesis text-to-speech
— Unverified 0Investigating Inter- and Intra-speaker Voice Conversion using Audiobooks Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 0Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language Dec 16, 2022 Language Modeling Language Modelling
— Unverified 0Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis May 20, 2020 Speech Synthesis text-to-speech
— Unverified 0ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech Feb 13, 2025 Adversarial Attack Adversarial Attack Detection
— Unverified 0VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment Jun 12, 2024 Quantization Speech Synthesis
— Unverified 0VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention Feb 12, 2021 Speech Synthesis text-to-speech
— Unverified 0Large tagset labeling using Feed Forward Neural Networks. Case study on Romanian Language Aug 1, 2013 Machine Translation Part-Of-Speech Tagging
— Unverified 0AS-Speech: Adaptive Style For Speech Synthesis Sep 9, 2024 Rhythm Speech Synthesis
— Unverified 0LDC Forced Aligner May 1, 2012 Sentence Speech Recognition
— Unverified 0Variations prosodiques en synth\`ese par s\'election d'unit\'es: l'exemple des phrases interrogatives (Prosodic variations in unit-based speech synthesis: the example of interrogative sentences) [in French] Jun 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0Learning Sentiment Lexicons in Spanish May 1, 2012 Opinion Mining Question Answering
— Unverified 0Leveraging supplemental representations for sequential transduction Jun 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0Lightweight End-to-end Text-to-speech Synthesis for low resource on-device applications May 12, 2025 Speech Synthesis text-to-speech
— Unverified 0A Review of Deep Learning Techniques for Speech Processing Apr 30, 2023 Automatic Speech Recognition Deep Learning
— Unverified 0Listening while Speaking: Speech Chain by Deep Learning Jul 16, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Location, Location: Enhancing the Evaluation of Text-to-Speech Synthesis Using the Rapid Prosody Transcription Paradigm Jul 6, 2021 Speech Synthesis text-to-speech
— Unverified 0Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network Sep 22, 2021 Knowledge Distillation Language Modeling
— Unverified 0Low-Resource Text-to-Speech Synthesis Using Noise-Augmented Training of ForwardTacotron Jan 10, 2025 Speech Synthesis text-to-speech
— Unverified 0M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis May 3, 2023 Speech Synthesis text-to-speech
— Unverified 0Machine Speech Chain with One-shot Speaker Adaptation Mar 28, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Vers une annotation automatique de corpus audio pour la synth\`ese de parole (Towards Fully Automatic Annotation of Audio Books for Text-To-Speech (TTS) Synthesis) [in French] Jun 1, 2012 Speech Synthesis text-to-speech
— Unverified 0Applying Syntaxx2013Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis Mar 29, 2022 Speech Synthesis text-to-speech
— Unverified 0Minimally Supervised Number Normalization Jan 1, 2016 speech-recognition Speech Recognition
— Unverified 0Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MM-TTS: Multi-modal Prompt based Style Transfer for Expressive Text-to-Speech Synthesis Dec 17, 2023 Speech Synthesis Style Transfer
— Unverified 0Accent conversion using discrete units with parallel data synthesized from controllable accented TTS Sep 30, 2024 Data Augmentation Speech Synthesis
— Unverified 0Modular Meta-Learning with Shrinkage Sep 12, 2019 Image Classification Meta-Learning
— Unverified 0Applying Automated Machine Translation to Educational Video Courses Jan 9, 2023 Machine Translation Speech Synthesis
— Unverified 0MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting May 19, 2023 Speech Synthesis text-to-speech
— Unverified 0Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning Feb 10, 2021 Speech Synthesis text-to-speech
— Unverified 0Multi-Scale Accent Modeling and Disentangling for Multi-Speaker Multi-Accent Text-to-Speech Synthesis Jun 16, 2024 Disentanglement Speech Synthesis
— Unverified 0