SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 801850 of 1419 papers

TitleStatusHype
Singing Synthesis: with a little help from my attention0
SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow0
SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs0
Smart Summarizer for Blind People0
SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech0
SNIPER Training: Single-Shot Sparse Training for Text-to-Speech0
SoK: A Study of the Security on Voice Processing Systems0
SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis0
Source Tracing of Audio Deepfake Systems0
MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis0
SpeakEasy: Enhancing Text-to-Speech Interactions for Expressive Content Creation0
Speaker-adaptive neural vocoders for parametric speech synthesis systems0
Speaker Generation0
Speaker-independent raw waveform model for glottal excitation0
Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis0
Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention0
SpeakStream: Streaming Text-to-Speech with Interleaved Data0
Speak While You Think: Streaming Speech Synthesis During Text Generation0
Spectral Codecs: Improving Non-Autoregressive Speech Synthesis with Spectrogram-Based Audio Codecs0
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant0
Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction0
Speech Aware Dialog System Technology Challenge (DSTC11)0
Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks0
Speech BERT Embedding For Improving Prosody in Neural TTS0
Speech denoising by parametric resynthesis0
Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody?0
Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis0
Speech Synthesis along Perceptual Voice Quality Dimensions0
Speech Synthesis for Low Resource Languages using Transliteration Enabled Transfer Learning0
Speech Synthesis of Code-Mixed Text0
Speech Synthesis with Mixed Emotions0
Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation0
Speech to Speech Translation with Translatotron: A State of the Art Review0
Speech to text and text to speech recognition systems-Areview0
Speech-T: Transducer for Text to Speech and Beyond0
Speech vocoding for laboratory phonology0
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer0
SpMis: An Investigation of Synthetic Spoken Misinformation Detection0
Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models0
SpoofCeleb: Speech Deepfake Detection and SASV In The Wild0
Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech0
SQuId: Measuring Speech Naturalness in Many Languages0
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech0
Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting0
StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations0
Streaming Non-Autoregressive Model for Accent Conversion and Pronunciation Improvement0
Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation0
StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling0
Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus0
Structured State Space Decoder for Speech Recognition and Synthesis0
Show:102550
← PrevPage 17 of 29Next →

No leaderboard results yet.