SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 11011150 of 1419 papers

TitleStatusHype
Fine-grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement0
Naturalization of Text by the Insertion of Pauses and Filler WordsCode0
Improving Prosody Modelling with Cross-Utterance BERT Embeddings for End-to-end Speech Synthesis0
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesisCode1
Semi-supervised URL Segmentation with Recurrent Neural NetworksPre-trained on Knowledge Graph EntitiesCode1
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework0
Prosodic Representation Learning and Contextual Sampling for Neural Text-to-Speech0
Incremental Machine Speech Chain Towards Enabling Listening while Speaking in Real-time0
StyleMelGAN: An Efficient High-Fidelity Adversarial Vocoder with Temporal Adaptive NormalizationCode1
Training Wake Word Detection with Synthesized Speech Data on Confusion Words0
Learning to Maximize Speech Quality Directly Using MOS Prediction for Neural Text-to-Speech0
Learning from Explanations and Demonstrations: A Pilot Study0
IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine TranslationCode1
Effective Deep Learning Models for Automatic Diacritization of Arabic TextCode1
DeviceTTS: A Small-Footprint, Fast, Stable Network for On-Device Text-to-Speech0
Effective Decoder Masking for Transformer Based End-to-End Speech Recognition0
One-class learning towards generalized voice spoofing detectionCode1
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators0
Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition0
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis0
The NTU-AISG Text-to-speech System for Blizzard Challenge 20200
NU-GAN: High resolution neural upsampling with GAN0
A Mask-based Model for Mandarin Chinese Polyphone Disambiguation0
Learning Speaker Embedding from Text-to-SpeechCode0
An Investigation of the Relation Between Grapheme Embeddings and Pronunciation for Tacotron-based Systems0
Replacing Human Audio with Synthetic Audio for On-device Unspoken Punctuation Prediction0
End-to-End Text-to-Speech using Latent Duration based on VQ-VAE0
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion0
Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An OverviewCode1
Improving Low Resource Code-switched ASR using Augmented Code-switched TTS0
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems0
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration ModelingCode1
Latent linguistic embedding for cross-lingual text-to-speech and voice conversion0
Neural Speech Synthesis for Estonian0
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTSCode0
JSSS: free Japanese speech corpus for summarization and simplificationCode0
Compress Polyphone Pronunciation Prediction Model with Shared Labels0
Automatic Arabic Dialect Identification Systems for Written Texts: A Survey0
Accent Estimation of Japanese Words from Their Surfaces and Romanizations for Building Large Vocabulary Accent DictionariesCode1
Hierarchical Multi-Grained Generative Model for Expressive Speech Synthesis0
Controllable neural text-to-speech synthesis using intuitive prosodic features0
What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS0
Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer0
Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style ConversionCode1
Textual Echo Cancellation0
Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length EmbeddingCode1
Unsupervised Learning For Sequence-to-sequence Text-to-speech For Low-resource Languages0
Bunched LPCNet : Vocoder for Low-cost Neural Text-To-Speech Systems0
Speaker Conditional WaveRNN: Towards Universal Neural Vocoder for Unseen Speaker and Recording ConditionsCode1
LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition0
Show:102550
← PrevPage 23 of 29Next →

No leaderboard results yet.