SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 11011125 of 1419 papers

TitleStatusHype
Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement0
Naturalization of Text by the Insertion of Pauses and Filler WordsCode0
Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis0
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesisCode1
Semi-supervised URL Segmentation with Recurrent Neural NetworksPre-trained on Knowledge Graph EntitiesCode1
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework0
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech0
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time0
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive NormalizationCode1
Training Wake Word Detection with Synthesized Speech Data on Confusion Words0
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech0
Learning from Explanations and Demonstrations: A Pilot Study0
IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine TranslationCode1
Effective Deep Learning Models for Automatic Diacritization of Arabic TextCode1
DeviceTTS: A Small-Footprint, Fast, Stable Network for On-Device Text-to-Speech0
Effective Decoder Masking for Transformer Based End-to-End Speech Recognition0
One-class learning towards generalized voice spoofing detectionCode1
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators0
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition0
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis0
The NTU-AISG Text-to-speech System for Blizzard Challenge 20200
NU-GAN: High resolution neural upsampling with GAN0
A Mask-based Model for Mandarin Chinese Polyphone Disambiguation0
Learning Speaker Embedding from Text-to-SpeechCode0
An Investigation of the Relation Between Grapheme Embeddings and Pronunciation for Tacotron-based Systems0
Show:102550
← PrevPage 45 of 57Next →

No leaderboard results yet.