SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 12011250 of 1419 papers

TitleStatusHype
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant0
Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction0
Speech Aware Dialog System Technology Challenge (DSTC11)0
Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks0
Speech BERT Embedding For Improving Prosody in Neural TTS0
Speech denoising by parametric resynthesis0
Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody?0
Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis0
Speech Synthesis along Perceptual Voice Quality Dimensions0
Speech Synthesis for Low Resource Languages using Transliteration Enabled Transfer Learning0
Speech Synthesis of Code-Mixed Text0
Speech Synthesis with Mixed Emotions0
Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation0
Speech to Speech Translation with Translatotron: A State of the Art Review0
Speech to text and text to speech recognition systems-Areview0
Speech-T: Transducer for Text to Speech and Beyond0
Speech vocoding for laboratory phonology0
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer0
SpMis: An Investigation of Synthetic Spoken Misinformation Detection0
Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models0
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild0
Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech0
SQuId: Measuring Speech Naturalness in Many Languages0
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech0
Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting0
StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations0
Streaming Non-Autoregressive Model for Accent Conversion and Pronunciation Improvement0
Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation0
StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling0
Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus0
Structured State Space Decoder for Speech Recognition and Synthesis0
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions0
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent0
Study of Indian English Pronunciation Variabilities relative to Received Pronunciation0
Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech0
Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN0
Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models0
StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis0
Style Mixture of Experts for Expressive Text-To-Speech Synthesis0
STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech0
Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation0
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion0
Style Variation as a Vantage Point for Code-Switching0
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System0
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition0
SyncSpeech: Low-Latency and Efficient Dual-Stream Text-to-Speech based on Temporal Masked Transformer0
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal0
Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech0
Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments0
SynthASR: Unlocking Synthetic Data for Speech Recognition0
Show:102550
← PrevPage 25 of 29Next →

No leaderboard results yet.