Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 901–950 of 1419 papers

Title	Date	Tasks	Status
Expressive, Variable, and Controllable Duration Modelling in TTS	Jun 28, 2022	Normalising FlowsSpeech Synthesis	—Unverified
Few-Shot Cross-Lingual TTS Using Transferable Phoneme Embedding	Jun 27, 2022	Few-Shot Learningtext-to-speech	—Unverified
Synthesizing Personalized Non-speech Vocalization from Discrete Speech Representations	Jun 25, 2022	text-to-speechText to Speech	—Unverified
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue	Jun 24, 2022	text-to-speechText to Speech	—Unverified
Exact Prosody Cloning in Zero-Shot Multispeaker Text-to-Speech	Jun 24, 2022	text-to-speechText to Speech	—Unverified
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech	Jun 24, 2022	Rhythmtext-to-speech	—Unverified
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data	Jun 22, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS	Jun 21, 2022	text-to-speechText to Speech	—Unverified
Towards Optimizing OCR for Accessibility	Jun 21, 2022	Optical Character Recognition (OCR)text-to-speech	—Unverified
NatiQ: An End-to-end Text-to-Speech System for Arabic	Jun 15, 2022	Decodertext-to-speech	—Unverified
A Novel Chinese Dialect TTS Frontend with Non-Autoregressive Neural Machine Translation	Jun 10, 2022	Machine Translationtext-to-speech	—Unverified
Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos	Jun 9, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
FlexLip: A Controllable Text-to-Lip System	Jun 7, 2022	Audio Generationtext-to-speech	—Unverified
Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE	Jun 6, 2022	Representation LearningSpeech Representation Learning	—Unverified
Audiobook Dialogues as Training Data for Conversational Style Synthetic Voices	Jun 1, 2022	Sentencetext-to-speech	—Unverified
Error Annotation in Post-Editing Machine Translation: Investigating the Impact of Text-to-Speech Technology	Jun 1, 2022	Machine Translationtext-to-speech	—Unverified
Exploring Transfer Learning for Urdu Speech Synthesis	Jun 1, 2022	Speech Synthesistext-to-speech	—Unverified
Text-to-Speech for Under-Resourced Languages: Phoneme Mapping and Source Language Selection in Transfer Learning	Jun 1, 2022	Cross-Lingual Transfertext-to-speech	—Unverified
The Nós Project: Opening routes for the Galician language in the field of language technologies	Jun 1, 2022	Cultural Vocal Bursts Intensity PredictionMachine Translation	—Unverified
An Open Source Web Reader for Under-Resourced Languages	Jun 1, 2022	text-to-speechText to Speech	CodeCode Available
Reading Assistance through LARA, the Learning And Reading Assistant	Jun 1, 2022	text-to-speechText to Speech	—Unverified
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition	Jun 1, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
BU-TTS: An Open-Source, Bilingual Welsh-English, Text-to-Speech Corpus	Jun 1, 2022	Speech Synthesistext-to-speech	—Unverified
Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments	Jun 1, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Investigating Inter- and Intra-speaker Voice Conversion using Audiobooks	Jun 1, 2022	Speech Synthesistext-to-speech	—Unverified
Using the LARA Little Prince to compare human and TTS audio quality	Jun 1, 2022	text-to-speechText to Speech	—Unverified
ParlamentParla: A Speech Corpus of Catalan Parliamentary Sessions	Jun 1, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish	May 31, 2022	Machine TranslationSpeech Synthesis	CodeCode Available
Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data	May 30, 2022	text-to-speechText to Speech	—Unverified
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning	May 29, 2022	ArticlesMachine Translation	—Unverified
QSpeech: Low-Qubit Quantum Speech Application Toolkit	May 26, 2022	text-to-speechText to Speech	CodeCode Available
T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation	May 24, 2022	DecoderMachine Translation	—Unverified
Talking Face Generation with Multilingual TTS	May 13, 2022	Face GenerationTalking Face Generation	—Unverified
ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence	May 9, 2022	Speech Synthesistext-to-speech	—Unverified
Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022	May 1, 2022	DecoderKnowledge Distillation	CodeCode Available
Systematic Inequalities in Language Technology Performance across the World’s Languages	May 1, 2022	Dependency ParsingMachine Translation	CodeCode Available
Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss	Apr 28, 2022	text-to-speechText to Speech	—Unverified
LibriS2S: A German-English Speech-to-Speech Translation Corpus	Apr 22, 2022	Speech-to-Speech TranslationSpeech-to-Text	CodeCode Available
Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation	Apr 21, 2022	Data Augmentationtext-to-speech	—Unverified
Audio Deep Fake Detection System with Neural Stitching for ADD 2022	Apr 19, 2022	text-to-speechText to Speech	—Unverified
Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech	Apr 14, 2022	Language Acquisitiontext-to-speech	—Unverified
Study of Indian English Pronunciation Variabilities relative to Received Pronunciation	Apr 13, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Enhancement of Pitch Controllability using Timbre-Preserving Pitch Augmentation in FastPitch	Apr 12, 2022	Sentencetext-to-speech	—Unverified
Fine-grained Noise Control for Multispeaker Speech Synthesis	Apr 11, 2022	Expressive Speech SynthesisSpeech Synthesis	—Unverified
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance	Apr 11, 2022	Speaker VerificationSpeech Synthesis	—Unverified
Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech	Apr 8, 2022	Diversitytext-to-speech	—Unverified
Karaoker: Alignment-free singing voice synthesis with speech training data	Apr 8, 2022	Singing Voice SynthesisSpeaker Identification	—Unverified
Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis	Apr 7, 2022	QuantizationSpeech Synthesis	—Unverified
Arabic Text-To-Speech (TTS) Data Preparation	Apr 7, 2022	text-to-speechText to Speech	—Unverified
Representation Selective Self-distillation and wav2vec 2.0 Feature Exploration for Spoof-aware Speaker Verification	Apr 6, 2022	AttributeSpeaker Verification	—Unverified

Show:10 25 50

← PrevPage 19 of 29Next →

No leaderboard results yet.