A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis Aug 3, 2022 Speech Synthesis text-to-speech
— Unverified 00 A Survey of Voice Translation Methodologies - Acoustic Dialect Decoder Oct 13, 2016 Decoder Sentence
— Unverified 00 A Survey on Bridging EEG Signals and Generative AI: From Image and Text to Beyond Feb 17, 2025 Contrastive Learning EEG
— Unverified 00 ASVspoof 2019: spoofing countermeasures for the detection of synthesized, converted and replayed speech Feb 11, 2021 Speaker Verification Speech Synthesis
— Unverified 00 ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech Feb 13, 2025 Adversarial Attack Adversarial Attack Detection
— Unverified 00 A Taxonomy of Specific Problem Classes in Text-to-Speech Synthesis: Comparing Commercial and Open Source Performance May 1, 2016 Speech Synthesis text-to-speech
— Unverified 00 A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis Mar 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture Jul 22, 2020 Rhythm Speech Synthesis
— Unverified 00 Attention Forcing for Sequence-to-sequence Model Training Sep 26, 2019 Machine Translation model
— Unverified 00 Attentive activation function for improving end-to-end spoofing countermeasure systems May 3, 2022 Speech Synthesis Voice Conversion
— Unverified 00 AttS2S-VC: Sequence-to-Sequence Voice Conversion with Attention and Context Preservation Mechanisms Nov 9, 2018 GPU Image Captioning
— Unverified 00 Audio Anti-spoofing Using a Simple Attention Module and Joint Optimization Based on Additive Angular Margin Loss and Meta-learning Nov 17, 2022 Binary Classification Meta-Learning
— Unverified 00 A Survey on Audio Diffusion Models: Text To Speech Synthesis and Enhancement in Generative AI Mar 23, 2023 Speech Enhancement Speech Synthesis
— Unverified 00 AudioVisual Speech Synthesis: A brief literature review Feb 18, 2021 Speech Synthesis text-to-speech
— Unverified 00 Audiovisual Speech Synthesis using Tacotron2 Aug 3, 2020 Face Model Sentence
— Unverified 00 Audio-visual video-to-speech synthesis with synthesized input audio Jul 31, 2023 Speech Synthesis
— Unverified 00 Augmented Prompt Selection for Evaluation of Spontaneous Speech Synthesis May 1, 2020 Speech Synthesis
— Unverified 00 Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework Nov 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Augmenting Polish Automatic Speech Recognition System With Synthetic Data Oct 30, 2024 Automatic Speech Recognition speech-recognition
— Unverified 00 A Unified Framework for Collecting Text-to-Speech Synthesis Datasets for 22 Indian Languages Oct 18, 2024 Speech Synthesis text-to-speech
— Unverified 00 A unified front-end framework for English text-to-speech synthesis May 18, 2023 Speech Synthesis Text Normalization
— Unverified 00 A unified lexical processing framework based on the Margin Infused Relaxed Algorithm. A case study on the Romanian Language Sep 1, 2013 Lemmatization Speech Synthesis
— Unverified 00 A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesis Nov 11, 2019 Polyphone disambiguation Speech Synthesis
— Unverified 00 A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation Jun 18, 2019 Decoder Speech Synthesis
— Unverified 00 A Unified Transformer-based Framework for Duplex Text Normalization Aug 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 AutoLV: Automatic Lecture Video Generator Sep 19, 2022 Speech Synthesis Talking Head Generation
— Unverified 00 Automated detection of pronunciation errors in non-native English speech employing deep learning Sep 13, 2022 Speech Synthesis
— Unverified 00 Automatically Acquiring Fine-Grained Information Status Distinctions in German Jul 1, 2012 Coreference Resolution Speech Synthesis
— Unverified 00 Automatic Arabic Dialect Identification Systems for Written Texts: A Survey Sep 26, 2020 Dialect Identification Machine Translation
— Unverified 00 Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis May 29, 2023 Speech Synthesis text-to-speech
— Unverified 00 Automatic Prosody Prediction for Chinese Speech Synthesis using BLSTM-RNN and Embedding Features Nov 2, 2015 Feature Engineering Prosody Prediction
— Unverified 00 Automatic Syllabification for Manipuri language Dec 1, 2016 Automatic Speech Recognition (ASR) Segmentation
— Unverified 00 Autoregressive Diffusion Transformer for Text-to-Speech Synthesis Jun 8, 2024 Audio Generation Decoder
— Unverified 00 Autoregressive Speech Synthesis with Next-Distribution Prediction Dec 22, 2024 Language Modeling Language Modelling
— Unverified 00 Autoregressive Speech Synthesis without Vector Quantization Jul 11, 2024 Audio Compression Diversity
— Unverified 00 Auto Spell Suggestion for High Quality Speech Synthesis in Hindi Feb 15, 2014 Speech Synthesis text-to-speech
— Unverified 00 AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis Apr 14, 2025 RAG Retrieval-augmented Generation
— Unverified 00 A Variational EM Method for Pole-Zero Modeling of Speech with Mixed Block Sparse and Gaussian Excitation Jun 24, 2017 speech-recognition Speech Recognition
— Unverified 00 AV-Flow: Transforming Text to Audio-Visual Human-like Interactions Feb 18, 2025 Speech Synthesis
— Unverified 00 A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis Oct 6, 2015 Speech Synthesis Vocal Bursts Intensity Prediction
— Unverified 00 BAD: An Assistant tool for making verses in Basque Apr 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 00 Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS Oct 9, 2024 Diversity Speech Synthesis
— Unverified 00 Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM Feb 24, 2025 Automatic Speech Recognition Language Modeling
— Unverified 00 Bayesian Subspace HMM for the Zerospeech 2020 Challenge May 19, 2020 Speech Synthesis
— Unverified 00 BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model Jul 4, 2022 Language Modeling Language Modelling
— Unverified 00 Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR Oct 16, 2024 Denoising Speech Synthesis
— Unverified 00 Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design Feb 6, 2023 Drug Discovery Learning Theory
— Unverified 00 BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models May 28, 2025 Speech Synthesis
— Unverified 00 Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks Oct 26, 2022 Image Captioning Language Modeling
— Unverified 00 Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion Jun 9, 2023 Denoising Speech Synthesis
— Unverified 00