Auto Spell Suggestion for High Quality Speech Synthesis in Hindi Feb 15, 2014 Speech Synthesis text-to-speech
— Unverified 0AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis Apr 14, 2025 RAG Retrieval-augmented Generation
— Unverified 0A Variational EM Method for Pole-Zero Modeling of Speech with Mixed Block Sparse and Gaussian Excitation Jun 24, 2017 speech-recognition Speech Recognition
— Unverified 0AV-Flow: Transforming Text to Audio-Visual Human-like Interactions Feb 18, 2025 Speech Synthesis
— Unverified 0A Waveform Representation Framework for High-quality Statistical Parametric Speech Synthesis Oct 6, 2015 Speech Synthesis Vocal Bursts Intensity Prediction
— Unverified 0BAD: An Assistant tool for making verses in Basque Apr 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0Bahasa Harmony: A Comprehensive Dataset for Bahasa Text-to-Speech Synthesis with Discrete Codec Modeling of EnGen-TTS Oct 9, 2024 Diversity Speech Synthesis
— Unverified 0Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM Feb 24, 2025 Automatic Speech Recognition Language Modeling
— Unverified 0Bayesian Subspace HMM for the Zerospeech 2020 Challenge May 19, 2020 Speech Synthesis
— Unverified 0BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model Jul 4, 2022 Language Modeling Language Modelling
— Unverified 0Beyond Oversmoothing: Evaluating DDPM and MSE for Scalable Speech Synthesis in ASR Oct 16, 2024 Denoising Speech Synthesis
— Unverified 0Beyond Statistical Similarity: Rethinking Metrics for Deep Generative Models in Engineering Design Feb 6, 2023 Drug Discovery Learning Theory
— Unverified 0BinauralFlow: A Causal and Streamable Approach for High-Quality Binaural Speech Synthesis with Flow Matching Models May 28, 2025 Speech Synthesis
— Unverified 0Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks Oct 26, 2022 Image Captioning Language Modeling
— Unverified 0Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion Jun 9, 2023 Denoising Speech Synthesis
— Unverified 0Boosting Large Language Model for Speech Synthesis: An Empirical Study Dec 30, 2023 Language Modeling Language Modelling
— Unverified 0BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0 Dec 21, 2023 Speech Synthesis Transfer Learning
— Unverified 0BUCEADOR, a multi-language search engine for digital libraries May 1, 2012 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Building and using language resources and infrastructure to develop e-learning programs for a minority language May 1, 2017 Language Acquisition Speech Synthesis
— Unverified 0Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis May 1, 2012 Audio-Visual Speech Recognition Speech Recognition
— Unverified 0Building A User-Centric and Content-Driven Socialbot May 6, 2020 Articles Management
— Unverified 0Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech May 1, 2018 Automatic Speech Recognition (ASR) Speech Recognition
— Unverified 0Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments Jun 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Building Synthetic Voices in the META-NET Framework May 1, 2012 Speech Synthesis Voice Conversion
— Unverified 0Building Text-to-Speech Systems for Resource Poor Languages May 1, 2012 Clustering Speech Synthesis
— Unverified 0Building Text-To-Speech Voices in the Cloud May 1, 2012 Speech Recognition Speech Synthesis
— Unverified 0BU-TTS: An Open-Source, Bilingual Welsh-English, Text-to-Speech Corpus Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 0Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes Nov 22, 2018 All speech-recognition
— Unverified 0CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center May 23, 2023 Speech Synthesis
— Unverified 0Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end? Sep 12, 2023 Self-Supervised Learning Speech Synthesis
— Unverified 0Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? Jun 11, 2024 Contrastive Learning Speech Synthesis
— Unverified 0Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data Mar 2, 2018 Generative Adversarial Network Speech Enhancement
— Unverified 0CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech Jun 3, 2025 Speech Synthesis text-to-speech
— Unverified 0Casa de la Lh\'engua: a set of language resources and natural language processing tools for Mirandese May 1, 2014 POS POS Tagging
— Unverified 0CASSANDRA: A multipurpose configurable voice-enabled human-computer-interface Apr 1, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Chain-of-Thought Training for Open E2E Spoken Dialogue Systems May 31, 2025 Language Modeling Language Modelling
— Unverified 0ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings May 23, 2023 Chatbot Reading Comprehension
— Unverified 0CHATR the Corpus; a 20-year-old archive of Concatenative Speech Synthesis May 1, 2016 Speech Synthesis
— Unverified 0CHiVE: Varying Prosody in Speech Synthesis with a Linguistically Driven Dynamic Hierarchical Conditional Variational Network May 17, 2019 Decoder Sentence
— Unverified 0ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus Feb 28, 2023 Speech Synthesis text-to-speech
— Unverified 0CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram Sep 12, 2023 Denoising Speech Denoising
— Unverified 0Cloning one's voice using very limited data in the wild Oct 7, 2021 Speech Synthesis
— Unverified 0CML-TTS A Multilingual Dataset for Speech Synthesis in Low-Resource Languages Jun 16, 2023 Speech Synthesis text-to-speech
— Unverified 0CoALT: A Software for Comparing Automatic Labelling Tools May 1, 2012 Speech Recognition Speech Synthesis
— Unverified 0CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems Jun 11, 2024 Audio Synthesis Face Swapping
— Unverified 0Code-Mixed Text to Speech Synthesis under Low-Resource Constraints Dec 2, 2023 Speech Synthesis text-to-speech
— Unverified 0Collaborative Watermarking for Adversarial Speech Synthesis Sep 26, 2023 Speaker Verification Speech Synthesis
— Unverified 0Collective Learning Mechanism based Optimal Transport Generative Adversarial Network for Non-parallel Voice Conversion Apr 18, 2025 Generative Adversarial Network Image Generation
— Unverified 0Combining Human Inputters and Language Services to provide Multi-language support system for International Symposiums Dec 1, 2016 Automatic Speech Recognition (ASR) Machine Translation
— Unverified 0Combining Incremental Language Generation and Incremental Speech Synthesis for Adaptive Information Presentation Jul 1, 2012 Speech Synthesis Spoken Dialogue Systems
— Unverified 0