Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1100 of 1419 papers

Title	Date	Tasks	Status
A Unified Transformer-based Framework for Duplex Text Normalization	Aug 23, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Fighting Game Commentator with Pitch and Loudness Adjustment Utilizing Highlight Cues	Aug 18, 2021	text-to-speechText to Speech	—Unverified
GC-TTS: Few-shot Speaker Adaptation with Geometric Constraints	Aug 16, 2021	text-to-speechText to Speech	—Unverified
Enhancing audio quality for expressive Neural Text-to-Speech	Aug 13, 2021	Acoustic ModellingSpeech Synthesis	—Unverified
RW-Resnet: A Novel Speech Anti-Spoofing Model Using Raw Waveform	Aug 12, 2021	Speaker VerificationSynthetic Speech Detection	—Unverified
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person	Aug 9, 2021	Talking Head Generationtext-to-speech	—Unverified
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text	Aug 1, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Speech-enabled Fixed-phrase Translator for Healthcare Accessibility	Aug 1, 2021	Machine Translationspeech-recognition	—Unverified
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing	Aug 1, 2021	Audio SynthesisMusic Generation	—Unverified
Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis	Jul 27, 2021	Expressive Speech SynthesisSpeech Synthesis	—Unverified
Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging	Jul 26, 2021	text-to-speechText to Speech	CodeCode Available
Digital Einstein Experience: Fast Text-to-Speech for Conversational AI	Jul 21, 2021	text-to-speechText to Speech	—Unverified
On Prosody Modeling for ASR+TTS based Voice Conversion	Jul 20, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging	Jul 12, 2021	PredictionSpeech Synthesis	CodeCode Available
Federated Learning with Dynamic Transformer for Text to Speech	Jul 9, 2021	Federated Learningtext-to-speech	—Unverified
Location, Location: Enhancing the Evaluation of Text-to-Speech Synthesis Using the Rapid Prosody Transcription Paradigm	Jul 6, 2021	Speech Synthesistext-to-speech	—Unverified
AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style	Jul 6, 2021	DecoderMixture-of-Experts	—Unverified
Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input	Jul 5, 2021	Speech Synthesistext-to-speech	CodeCode Available
GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis	Jun 29, 2021	Speech Synthesistext-to-speech	—Unverified
Multi-Scale Spectrogram Modelling for Neural Text-to-Speech	Jun 29, 2021	Sentencetext-to-speech	—Unverified
Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech	Jun 29, 2021	DecoderSentence	—Unverified
Non-Autoregressive TTS with Explicit Duration Modelling for Low-Resource Highly Expressive Speech	Jun 24, 2021	Generative Adversarial Networktext-to-speech	—Unverified
Non-native English lexicon creation for bilingual speech synthesis	Jun 21, 2021	Speech Synthesistext-to-speech	—Unverified
Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters	Jun 19, 2021	Speech Synthesistext-to-speech	—Unverified
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model	Jun 17, 2021	Emotional Speech SynthesisEmotion Classification	—Unverified
Improving the expressiveness of neural vocoding with non-affine Normalizing Flows	Jun 16, 2021	text-to-speechText to Speech	—Unverified
ADEPT: A Dataset for Evaluating Prosody Transfer	Jun 15, 2021	text-to-speechText to Speech	—Unverified
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis	Jun 15, 2021	Speech Synthesistext-to-speech	—Unverified
A learned conditional prior for the VAE acoustic space of a TTS system	Jun 14, 2021	Sentencetext-to-speech	—Unverified
SynthASR: Unlocking Synthetic Data for Speech Recognition	Jun 14, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Improving multi-speaker TTS prosody variance with a residual encoder and normalizing flows	Jun 10, 2021	DisentanglementSentence	—Unverified
Speech BERT Embedding For Improving Prosody in Neural TTS	Jun 8, 2021	Decodertext-to-speech	—Unverified
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios	Jun 7, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Reinforce-Aligner: Reinforcement Alignment Search for Robust End-to-End Text-to-Speech	Jun 5, 2021	text-to-speechText to Speech	—Unverified
Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis	Jun 3, 2021	Data AugmentationSpeaker Verification	—Unverified
An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis	Jun 3, 2021	Speaker VerificationSpeech Synthesis	—Unverified
Dual Script E2E framework for Multilingual and Code-Switching ASR	Jun 2, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Corpus of Neutral Voice Speech in Brazilian Portuguese	May 21, 2021	Speech Synthesistext-to-speech	—Unverified
Learning Robust Latent Representations for Controllable Speech Synthesis	May 10, 2021	Speech Synthesistext-to-speech	—Unverified
Talrómur: A large Icelandic TTS corpus	May 1, 2021	text-to-speechText to Speech	—Unverified
On Addressing Practical Challenges for RNN-Transducer	Apr 27, 2021	speech-recognitionSpeech Recognition	—Unverified
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis	Apr 26, 2021	Language ModelingLanguage Modelling	CodeCode Available
Non-autoregressive sequence-to-sequence voice conversion	Apr 14, 2021	text-to-speechText to Speech	—Unverified
Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis	Apr 14, 2021	Dependency ParsingRepresentation Learning	—Unverified
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures	Apr 12, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation	Apr 8, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Flavored Tacotron: Conditional Learning for Prosodic-linguistic Features	Apr 8, 2021	DecoderSpeech Synthesis	—Unverified
Grapheme-to-Phoneme Transformer Model for Transfer Learning Dialects	Apr 8, 2021	text-to-speechText to Speech	—Unverified
AI4D -- African Language Program	Apr 6, 2021	Machine Translationspeech-recognition	CodeCode Available
Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability	Apr 3, 2021	Emotion Recognitionreinforcement-learning	—Unverified

Show:10 25 50

← PrevPage 22 of 29Next →

No leaderboard results yet.