SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 901950 of 1419 papers

TitleStatusHype
The ILMT-s2s Corpus ― A Multimodal Interlingual Map Task Corpus0
The Impact of Silence on Speech Anti-Spoofing0
The MSXF TTS System for ICASSP 2022 ADD Challenge0
The Nós Project: Opening routes for the Galician language in the field of language technologies0
The NTU-AISG Text-to-speech System for Blizzard Challenge 20200
The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance0
The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach0
The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains0
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge0
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain0
TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality0
T-Modules: Translation Modules for Zero-Shot Cross-Modal Machine Translation0
Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion0
Total-Duration-Aware Duration Modeling for Text-to-Speech Systems0
Towards Accurate Lip-to-Speech Synthesis in-the-Wild0
Towards a Japanese Full-duplex Spoken Dialogue System0
Towards a Language Service Infrastructure for Mobile Environments0
Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer0
Towards Flow-Matching-based TTS without Classifier-Free Guidance0
Towards Fully Automatic Annotation of Audio Books for TTS0
Towards human-like spoken dialogue generation between AI agents from written dialogue0
Towards Lightweight and Stable Zero-shot TTS with Self-distilled Representation Disentanglement0
Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale0
Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram0
Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion0
Towards Optimizing OCR for Accessibility0
Towards Robust FastSpeech 2 by Modelling Residual Multimodality0
Towards Robust Neural Vocoding for Speech Generation: A Survey0
Towards Selection of Text-to-speech Data to Augment ASR Training0
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis0
Towards Transfer Learning for End-to-End Speech Synthesis from Deep Pre-Trained Language Models0
Towards zero-shot Text-based voice editing using acoustic context conditioning, utterance embeddings, and reference encoders0
Towards Zero-Shot Text-To-Speech for Arabic Dialects0
Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora0
Training Universal Vocoders with Feature Smoothing-Based Augmentation Methods for High-Quality TTS Systems0
Training Wake Word Detection with Synthesized Speech Data on Confusion Words0
Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation0
Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction0
Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus0
Transfer the linguistic representations from TTS to accent conversion with non-parallel data0
Transformer-based Models of Text Normalization for Speech Applications0
Transplantation of Conversational Speaking Style with Interjections in Sequence-to-Sequence Speech Synthesis0
Triple M: A Practical Text-to-speech Synthesis System With Multi-guidance Attention And Multi-band Multi-time LPCNet0
TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder0
TTSDS2: Resources and Benchmark for Evaluating Human-Quality Text to Speech Systems0
TTS for Low Resource Languages: A Bangla Synthesizer0
TTS-Guided Training for Accent Conversion Without Parallel Data0
TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations0
TTS-Transducer: End-to-End Speech Synthesis with Neural Transducer0
UmbraTTS: Adapting Text-to-Speech to Environmental Contexts with Flow Matching0
Show:102550
← PrevPage 19 of 29Next →

No leaderboard results yet.