SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 526550 of 1419 papers

TitleStatusHype
Augmenting text for spoken language understanding with Large Language Models0
HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methodsCode1
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions0
Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech0
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech CodecCode2
Direct Text to Speech Translation System using Acoustic Units0
Multi-Modal Automatic Prosody Annotation with Contrastive Pretraining of SSWPCode1
VoiceFlow: Efficient Text-to-Speech with Rectified Flow MatchingCode2
Cross-Utterance Conditioned VAE for Speech Generation0
Large-Scale Automatic Audiobook Creation0
MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 20230
GRASS: Unified Generation Model for Speech-to-Semantic Tasks0
PromptTTS 2: Describing and Generating Voices with Text Prompt0
A Comparative Analysis of Pretrained Language Models for Text-to-Speech0
The FruitShell French synthesis system at the Blizzard 2023 Challenge0
Learning Speech Representation From Contrastive Token-Acoustic Pretraining0
QS-TTS: Towards Semi-Supervised Text-to-Speech Synthesis via Vector-Quantized Self-Supervised Speech Representation LearningCode1
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language ModelsCode2
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis0
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information0
The DeepZen Speech Synthesis System for Blizzard Challenge 20230
Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech0
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech ModelsCode1
Rep2wav: Noise Robust text-to-speech Using self-supervised representations0
Generalizable Zero-Shot Speaker Adaptive Speech Synthesis with Disentangled Representations0
Show:102550
← PrevPage 22 of 57Next →

No leaderboard results yet.