SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 151175 of 1419 papers

TitleStatusHype
Accent Estimation of Japanese Words from Their Surfaces and Romanizations for Building Large Vocabulary Accent DictionariesCode1
HM-Conformer: A Conformer-based audio deepfake detection system with hierarchical pooling and multi-level classification token aggregation methodsCode1
Imaginary Voice: Face-styled Diffusion Model for Text-to-SpeechCode1
Grad-TTS: A Diffusion Probabilistic Model for Text-to-SpeechCode1
Accented Text-to-Speech Synthesis with a Conditional Variational AutoencoderCode1
GUIRoboTron-Speech: Towards Automated GUI Agents Based on Speech InstructionsCode1
AdaSpeech: Adaptive Text to Speech for Custom VoiceCode1
An End-to-end Chinese Text Normalization Model based on Rule-guided Flat-Lattice TransformerCode1
Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An OverviewCode1
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer LearningCode1
g2pM: A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark DatasetCode1
From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech RecognitionCode1
From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback ConstraintCode1
An Efficient Membership Inference Attack for the Diffusion Model by Proximal InitializationCode1
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech SynthesisCode1
FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech SynthesisCode1
FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech DetectionCode1
AudioMarkBench: Benchmarking Robustness of Audio WatermarkingCode1
FastPitch: Parallel Text-to-speech with Pitch PredictionCode1
Fine-grained style control in Transformer-based Text-to-speech SynthesisCode1
AdaSpeech 2: Adaptive Text to Speech with Untranscribed DataCode1
Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language ModelsCode1
FastPitchFormant: Source-filter based Decomposed Modeling for Speech SynthesisCode1
FastSpeech 2: Fast and High-Quality End-to-End Text to SpeechCode1
Evaluating Speech Synthesis by Training Recognizers on Synthetic SpeechCode1
Show:102550
← PrevPage 7 of 57Next →

No leaderboard results yet.