SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 11511200 of 1419 papers

TitleStatusHype
Pretraining Techniques for Sequence-to-Sequence Voice ConversionCode1
Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes0
Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning0
Phonological Features for 0-shot Multilingual Speech SynthesisCode1
One Model, Many Languages: Meta-learning for Multilingual Text-to-SpeechCode1
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability0
A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture0
Normalizing Text using Language Modelling based on Phonetics and String Similarity0
Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework0
FastPitch: Parallel Text-to-speech with Pitch PredictionCode1
FastSpeech 2: Fast and High-Quality End-to-End Text to SpeechCode1
MultiSpeech: Multi-Speaker Text to Speech with TransformerCode1
Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised LearningCode0
End-to-End Adversarial Text-to-SpeechCode1
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment SearchCode1
NAUTILUS: a Versatile Voice Cloning System0
Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario0
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis0
Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech0
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders0
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation0
JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment0
You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation0
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech SynthesisCode1
DiscreTalk: Text-to-Speech as a Machine Translation Problem0
AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN0
Exploring TTS without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020)Code0
Luganda Text-to-Speech MachineCode0
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback ConstraintCode1
Crowdsourcing Latin American Spanish for Low-Resource Text-to-Speech0
Open-source Multi-speaker Speech Corpora for Building Gujarati, Kannada, Malayalam, Marathi, Tamil and Telugu Speech Synthesis Systems0
Burmese Speech Corpus, Finite-State Text Normalization and Pronunciation Grammars with an Application to Text-to-Speech0
IndicSpeech: Text-to-Speech Corpus for Indian Languages0
Corpus Generation for Voice Command in Smart Home and the Effect of Speech Synthesis on End-to-End SLU0
Development and Evaluation of Speech Synthesis Corpora for Latvian0
Neural Text-to-Speech Synthesis for an Under-Resourced Language in a Diglossic Environment: the Case of Gascon Occitan0
Open-Source High Quality Speech Datasets for Basque, Catalan and Galician0
Style Variation as a Vantage Point for Code-Switching0
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech0
A Study of Non-autoregressive Model for Sequence Generation0
ESPnet-ST: All-in-One Speech Translation Toolkit0
Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System0
Transformer based Grapheme-to-Phoneme ConversionCode1
Scalable Multilingual Frontend for TTS0
Generating Multilingual Voices Using Speaker Space Translation Based on Bilingual Speaker Data0
Improving Readability for Automatic Speech Recognition Transcription0
g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark DatasetCode1
Perception of prosodic variation for speech synthesis using an unsupervised discrete representation of F0Code1
Statistical Context-Dependent Units Boundary Correction for Corpus-based Unit-Selection Text-to-Speech0
AlignTTS: Efficient Feed-Forward Text-to-Speech System without Explicit AlignmentCode0
Show:102550
← PrevPage 24 of 29Next →

No leaderboard results yet.