SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 151175 of 1419 papers

TitleStatusHype
Accent Estimation of Japanese Words from Their Surfaces and Romanizations for Building Large Vocabulary Accent DictionariesCode1
HUI-Audio-Corpus-German: A high quality TTS datasetCode1
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer LearningCode1
GUIRoboTron-Speech: Towards Automated GUI Agents Based on Speech InstructionsCode1
Accented Text-to-Speech Synthesis with a Conditional Variational AutoencoderCode1
AdaSpeech: Adaptive Text to Speech for Custom VoiceCode1
An End-to-end Chinese Text Normalization Model based on Rule-guided Flat-Lattice TransformerCode1
Grad-TTS: A Diffusion Probabilistic Model for Text-to-SpeechCode1
HiFi-WaveGAN: Generative Adversarial Network with Auxiliary Spectrogram-Phase Loss for High-Fidelity Singing Voice GenerationCode1
Improving fairness for spoken language understanding in atypical speech with Text-to-SpeechCode1
From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech RecognitionCode1
g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark DatasetCode1
FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech DetectionCode1
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech SynthesisCode1
Fine-grained style control in Transformer-based Text-to-speech SynthesisCode1
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback ConstraintCode1
Glow-TTS: A Generative Flow for Text-to-Speech via Monotonic Alignment SearchCode1
AudioMarkBench: Benchmarking Robustness of Audio WatermarkingCode1
An Efficient Membership Inference Attack for the Diffusion Model by Proximal InitializationCode1
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech SynthesisCode1
FastSpeech 2: Fast and High-Quality End-to-End Text to SpeechCode1
AdaSpeech 2: Adaptive Text to Speech with Untranscribed DataCode1
Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language ModelsCode1
FastPitch: Parallel Text-to-speech with Pitch PredictionCode1
Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech UnderstandingCode1
Show:102550
← PrevPage 7 of 57Next →

No leaderboard results yet.