SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 11511200 of 1419 papers

TitleStatusHype
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework0
Training Wake Word Detection with Synthesized Speech Data on Confusion Words0
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech0
Learning from Explanations and Demonstrations: A Pilot Study0
DeviceTTS: A Small-Footprint, Fast, Stable Network for On-Device Text-to-Speech0
Effective Decoder Masking for Transformer Based End-to-End Speech Recognition0
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators0
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition0
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis0
The NTU-AISG Text-to-speech System for Blizzard Challenge 20200
NU-GAN: High resolution neural upsampling with GAN0
Learning Speaker Embedding from Text-to-SpeechCode0
A Mask-based Model for Mandarin Chinese Polyphone Disambiguation0
An Investigation of the Relation Between Grapheme Embeddings and Pronunciation for Tacotron-based Systems0
Replacing Human Audio with Synthetic Audio for On-device Unspoken Punctuation Prediction0
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE0
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion0
Improving Low Resource Code-switched ASR using Augmented Code-switched TTS0
Latent linguistic embedding for cross-lingual text-to-speech and voice conversion0
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems0
Neural Speech Synthesis for Estonian0
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTSCode0
JSSS: free Japanese speech corpus for summarization and simplificationCode0
Compress Polyphone Pronunciation Prediction Model with Shared Labels0
Automatic Arabic Dialect Identification Systems for Written Texts: A Survey0
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis0
Controllable neural text-to-speech synthesis using intuitive prosodic features0
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS0
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer0
Textual Echo Cancellation0
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages0
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems0
LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition0
Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes0
Incremental Text to Speech for Neural Sequence-to-Sequence Models using Reinforcement Learning0
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability0
A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture0
Normalizing Text using Language Modelling based on Phonetics and String Similarity0
Generic Indic Text-to-speech Synthesisers with Rapid Adaptation in an End-to-end Framework0
Defense for Black-box Attacks on Anti-spoofing Models by Self-Supervised LearningCode0
NAUTILUS: a Versatile Voice Cloning System0
Cross-lingual Multispeaker Text-to-Speech under Limited-Data Scenario0
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis0
Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech0
Knowledge-and-Data-Driven Amplitude Spectrum Prediction for Hierarchical Neural Vocoders0
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation0
JDI-T: Jointly trained Duration Informed Transformer for Text-To-Speech without Explicit Alignment0
You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation0
AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN0
DiscreTalk: Text-to-Speech as a Machine Translation Problem0
Show:102550
← PrevPage 24 of 29Next →

No leaderboard results yet.