Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–900 of 1419 papers

Title	Date	Tasks	Status
Explicit Intensity Control for Accented Text-to-speech	Oct 27, 2022	speech-recognitionSpeech Recognition	—Unverified
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech	Oct 27, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Improving Speech-to-Speech Translation Through Unlabeled Text	Oct 26, 2022	Machine Translationspeech-recognition	—Unverified
Semi-Supervised Learning Based on Reference Model for Low-resource TTS	Oct 25, 2022	Speech Synthesistext-to-speech	—Unverified
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data	Oct 25, 2022	DecoderDisentanglement	—Unverified
Efficiently Trained Low-Resource Mongolian Text-to-Speech System Based On FullConv-TTS	Oct 24, 2022	Data AugmentationGPU	—Unverified
Low-Resource Multilingual and Zero-Shot Multispeaker TTS	Oct 21, 2022	Meta-Learningtext-to-speech	—Unverified
Adaptive re-calibration of channel-wise features for Adversarial Audio Classification	Oct 21, 2022	Audio ClassificationFace Swapping	—Unverified
Generating Synthetic Speech from SpokenVocab for Speech Translation	Oct 15, 2022	Data AugmentationMachine Translation	CodeCode Available
LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge	Oct 14, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy	Oct 13, 2022	Generative Adversarial NetworkSpeaker anonymization	—Unverified
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar	Oct 13, 2022	text-to-speechText to Speech	—Unverified
SQuId: Measuring Speech Naturalness in Many Languages	Oct 12, 2022	Diversitytext-to-speech	—Unverified
Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech	Oct 12, 2022	text-to-speechText to Speech	—Unverified
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era	Oct 6, 2022	Speech Synthesistext-to-speech	—Unverified
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis	Oct 1, 2022	Speech Synthesistext-to-speech	—Unverified
Facial Landmark Predictions with Applications to Metaverse	Sep 29, 2022	Decodertext-to-speech	CodeCode Available
Multi-Task Adversarial Training Algorithm for Multi-Speaker Neural Text-to-Speech	Sep 26, 2022	Generative Adversarial Networktext-to-speech	—Unverified
EPIC TTS Models: Empirical Pruning Investigations Characterizing Text-To-Speech Models	Sep 22, 2022	Speech Synthesistext-to-speech	—Unverified
Controllable Accented Text-to-Speech Synthesis	Sep 22, 2022	Speech Synthesistext-to-speech	—Unverified
Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset	Sep 14, 2022	text-to-speechText to Speech	—Unverified
SANIP: Shopping Assistant and Navigation for the visually impaired	Sep 8, 2022	Objectobject-detection	—Unverified
Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech	Sep 7, 2022	ArticlesSentence	—Unverified
Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers	Sep 5, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model	Sep 2, 2022	text-to-speechText to Speech	—Unverified
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale	Aug 21, 2022	LipreadingLip Reading	—Unverified
Speech Synthesis with Mixed Emotions	Aug 11, 2022	AttributeEmotional Speech Synthesis	—Unverified
A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis	Aug 3, 2022	Speech Synthesistext-to-speech	—Unverified
Low-data? No problem: low-resource, language-agnostic conversational text-to-speech via F0-conditioned data augmentation	Jul 29, 2022	Data Augmentationtext-to-speech	—Unverified
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis	Jul 25, 2022	Data AugmentationSpeech Synthesis	—Unverified
When Is TTS Augmentation Through a Pivot Language Useful?	Jul 20, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate	Jul 13, 2022	Speech Separationtext-to-speech	—Unverified
A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System	Jul 13, 2022	text-to-speechText to Speech	—Unverified
Text-driven Emotional Style Control and Cross-speaker Style Transfer in Neural TTS	Jul 13, 2022	Language ModelingLanguage Modelling	—Unverified
End-to-end speech recognition modeling from de-identified data	Jul 12, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition	Jul 12, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
LIP: Lightweight Intelligent Preprocessor for meaningful text-to-speech	Jul 11, 2022	text-to-speechText to Speech	—Unverified
Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS)	Jul 4, 2022	Speech Synthesistext-to-speech	—Unverified
BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model	Jul 4, 2022	Language ModelingLanguage Modelling	—Unverified
Unify and Conquer: How Phonetic Feature Representation Affects Polyglot Text-To-Speech (TTS)	Jul 4, 2022	text-to-speechText to Speech	—Unverified
Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need	Jul 2, 2022	AllSpeech Synthesis	—Unverified
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems	Jul 1, 2022	text-to-speechText to Speech	—Unverified
Fast Bilingual Grapheme-To-Phoneme Conversion	Jul 1, 2022	Data AugmentationGrapheme-to-Phoneme Conversion	—Unverified
A Polyphone BERT for Polyphone Disambiguation in Mandarin Chinese	Jul 1, 2022	Polyphone disambiguationtext-to-speech	—Unverified
Automatic Evaluation of Speaker Similarity	Jul 1, 2022	Speaker Verificationtext-to-speech	—Unverified
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder	Jun 30, 2022	Speech Synthesistext-to-speech	—Unverified
R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS	Jun 30, 2022	DecoderGPU	—Unverified
Improving Deliberation by Text-Only and Semi-Supervised Training	Jun 29, 2022	DecoderLanguage Modeling	—Unverified
Simple and Effective Multi-sentence TTS with Expressive and Coherent Prosody	Jun 29, 2022	Language ModelingLanguage Modelling	—Unverified
Comparison of Speech Representations for the MOS Prediction System	Jun 28, 2022	Self-Supervised Learningtext-to-speech	—Unverified

Show:10 25 50

← PrevPage 18 of 29Next →

No leaderboard results yet.