Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 326–350 of 1419 papers

Title	Date	Tasks	Status	Hype
Source Tracing of Audio Deepfake Systems	Jul 10, 2024	Face Swappingtext-to-speech	—Unverified	0
ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation	Jul 7, 2024	Sentencetext-to-speech	—Unverified	0
Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation	Jul 7, 2024	Text to Speech	CodeCode Available	0
CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens	Jul 7, 2024	Language ModellingLarge Language Model	CodeCode Available	11
Optimizing a-DCF for Spoofing-Robust Speaker Verification	Jul 4, 2024	Speaker VerificationText to Speech	—Unverified	0
Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis	Jul 4, 2024	Accented Speech RecognitionAutomatic Speech Recognition	—Unverified	0
On the Effectiveness of Acoustic BPE in Decoder-Only TTS	Jul 4, 2024	DecoderDiversity	—Unverified	0
CATT: Character-based Arabic Tashkeel Transformer	Jul 3, 2024	Arabic Text DiacritizationDecoder	CodeCode Available	2
TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations	Jul 2, 2024	Benchmarkingtext-to-speech	—Unverified	0
Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization	Jul 2, 2024	Inference OptimizationSpeech Synthesis	—Unverified	0
Lightweight Zero-shot Text-to-Speech with Mixture of Adapters	Jul 1, 2024	DecoderSpeech Synthesis	—Unverified	0
FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis	Jun 30, 2024	CPUDecoder	—Unverified	0
NAIST Simultaneous Speech Translation System for IWSLT 2024	Jun 30, 2024	Speech-to-Speech TranslationSpeech-to-Text	—Unverified	0
Open-Source Conversational AI with SpeechBrain 1.0	Jun 29, 2024	Language ModelingLanguage Modelling	—Unverified	0
Application of ASV for Voice Identification after VC and Duration Predictor Improvement in TTS Models	Jun 27, 2024	Speaker Verificationtext-to-speech	—Unverified	0
DEX-TTS: Diffusion-based EXpressive Text-to-Speech with Style Modeling on Time Variability	Jun 27, 2024	Speech Synthesistext-to-speech	CodeCode Available	2
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS	Jun 26, 2024	text-to-speechText to Speech	CodeCode Available	1
Automatic Speech Recognition for Hindi	Jun 26, 2024	Action DetectionActivity Detection	—Unverified	0
LLM-Driven Multimodal Opinion Expression Identification	Jun 26, 2024	text-to-speechText to Speech	—Unverified	0
High Fidelity Text-to-Speech Via Discrete Tokens Using Token Transducer and Group Masked Language Model	Jun 25, 2024	Computational EfficiencyLanguage Modeling	—Unverified	0
Leveraging Parameter-Efficient Transfer Learning for Multi-Lingual Text-to-Speech Adaptation	Jun 25, 2024	Speech Synthesistext-to-speech	—Unverified	0
Improving Robustness of LLM-based Speech Synthesis by Learning Monotonic Alignment	Jun 25, 2024	DecoderLanguage Modeling	—Unverified	0
Towards Zero-Shot Text-To-Speech for Arabic Dialects	Jun 24, 2024	Dialect IdentificationSpeech Synthesis	—Unverified	0
A multi-speaker multi-lingual voice cloning system based on vits2 for limmits 2024 challenge	Jun 22, 2024	Speech Synthesistext-to-speech	—Unverified	0
TacoLM: GaTed Attention Equipped Codec Language Model are Efficient Zero-Shot Text to Speech Synthesizers	Jun 22, 2024	DecoderLanguage Modeling	CodeCode Available	1

Show:10 25 50

← PrevPage 14 of 57Next →

No leaderboard results yet.