SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 12511300 of 1419 papers

TitleStatusHype
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition0
Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations0
Synthetic Speaking Children -- Why We Need Them and How to Make Them0
Synthetic Speech Detection Based on Temporal Consistency and Distribution of Speaker Features0
Talking Face Generation with Multilingual TTS0
Talrómur: A large Icelandic TTS corpus0
Statistical Context-Dependent Units Boundary Correction for Corpus-based Unit-Selection Text-to-Speech0
Teacher-Student Training for Robust Tacotron-based TTS0
Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages0
Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale0
Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise0
Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations0
Text-aware and Context-aware Expressive Audiobook Speech Synthesis0
Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS0
Text-free non-parallel many-to-many voice conversion using normalising flows0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis0
Text Is Not All You Need: Multimodal Prompting Helps LLMs Understand Humor0
Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens0
Text-To-Speech Data Augmentation for Low Resource Speech Recognition0
Text-To-Speech for Languages without an Orthography0
Text-to-Speech for Under-Resourced Languages: Phoneme Mapping and Source Language Selection in Transfer Learning0
Text-to-Speech Pipeline for Swiss German -- A comparison0
Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder0
Text-To-Speech Synthesis In The Wild0
Textual Echo Cancellation0
The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives0
The C-ORAL-BRASIL I: Reference Corpus for Spoken Brazilian Portuguese0
The DeepZen Speech Synthesis System for Blizzard Challenge 20230
The Effects of Input Type and Pronunciation Dictionary Usage in Transfer Learning for Low-Resource Text-to-Speech0
The FruitShell French synthesis system at the Blizzard 2023 Challenge0
The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus0
The Impact of Silence on Speech Anti-Spoofing0
The MSXF TTS System for ICASSP 2022 ADD Challenge0
The Nós Project: Opening routes for the Galician language in the field of language technologies0
The NTU-AISG Text-to-speech System for Blizzard Challenge 20200
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance0
The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach0
The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains0
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge0
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain0
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality0
T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation0
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion0
Total-Duration-Aware Duration Modeling for Text-to-Speech Systems0
Towards Accurate Lip-to-Speech Synthesis in-the-Wild0
Towards a Japanese Full-duplex Spoken Dialogue System0
Towards a Language Service Infrastructure for Mobile Environments0
Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer0
Towards Flow-Matching-based TTS without Classifier-Free Guidance0
Show:102550
← PrevPage 26 of 29Next →

No leaderboard results yet.