Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1376–1400 of 1419 papers

Title	Date	Tasks	Status
Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment	Dec 4, 2020	Meta-Learningtext-to-speech	CodeCode Available
Multi-modal and Multi-scale Spatial Environment Understanding for Immersive Visual Text-to-Speech	Dec 16, 2024	text-to-speechText to Speech	CodeCode Available
Continuous Speech Tokenizer in Text To Speech	Oct 22, 2024	Language ModelingLanguage Modelling	CodeCode Available
MLS: A Large-Scale Multilingual Dataset for Speech Research	Dec 7, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers	Sep 5, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis	Jun 12, 2018	Speaker VerificationSpeech Synthesis	CodeCode Available
High Fidelity Speech Synthesis with Adversarial Networks	Sep 25, 2019	Generative Adversarial NetworkSpeech Synthesis	CodeCode Available
A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture	Jan 6, 2022	Speech-to-Texttext-to-speech	CodeCode Available
Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation	Jul 7, 2024	Text to Speech	CodeCode Available
Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging	Jul 26, 2021	text-to-speechText to Speech	CodeCode Available
VIFS: An End-to-End Variational Inference for Foley Sound Synthesis	Jun 8, 2023	Speech Synthesistext-to-speech	CodeCode Available
Semantic Mask for Transformer based End-to-End Speech Recognition	Dec 6, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Effective parameter estimation methods for an ExcitNet model in generative text-to-speech systems	May 21, 2019	parameter estimationSpeech Synthesis	CodeCode Available
The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS	Oct 6, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Hierarchical Generative Modeling for Controllable Speech Synthesis	Oct 16, 2018	AttributeSpeech Synthesis	CodeCode Available
MelNet: A Generative Model for Audio in the Frequency Domain	Jun 4, 2019	Audio GenerationMusic Generation	CodeCode Available
Using generative modelling to produce varied intonation for speech synthesis	Jun 10, 2019	SentenceSpeech Synthesis	CodeCode Available
Applying Phonological Features in Multilingual Text-To-Speech	Oct 7, 2021	Language Acquisitiontext-to-speech	CodeCode Available
Massively Multilingual Neural Grapheme-to-Phoneme Conversion	Aug 4, 2017	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the Bible	Jul 30, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
StyleSpeech: Parameter-efficient Fine Tuning for Pre-trained Controllable Text-to-Speech	Aug 27, 2024	parameter-efficient fine-tuningtext-to-speech	CodeCode Available
Sequence Transduction with Recurrent Neural Networks	Nov 14, 2012	Machine TranslationPhoneme Recognition	CodeCode Available
Audio Super Resolution using Neural Networks	Aug 2, 2017	Audio GenerationAudio Super-Resolution	CodeCode Available
Generating Synthetic Speech from SpokenVocab for Speech Translation	Oct 15, 2022	Data AugmentationMachine Translation	CodeCode Available
Generating Data with Text-to-Speech and Large-Language Models for Conversational Speech Recognition	Aug 17, 2024	Language ModelingLanguage Modelling	CodeCode Available

Show:10 25 50

← PrevPage 56 of 57Next →

No leaderboard results yet.