SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 501550 of 1419 papers

TitleStatusHype
Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement0
Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model0
CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech0
Explicit Intensity Control for Accented Text-to-speech0
A New Approach to Voice Authenticity0
Exploiting Transliterated Words for Finding Similarity in Inter-Language News Articles using Machine Learning0
Exploring an Inter-Pausal Unit (IPU) based Approach for Indic End-to-End TTS Systems0
Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation0
Exploring Speech Enhancement for Low-resource Speech Synthesis0
Exploring speech style spaces with language models: Emotional TTS without emotion labels0
Development and Evaluation of Speech Synthesis Corpora for Latvian0
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability0
Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners0
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention0
Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis0
A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music0
Designing French Tale Corpora for Entertaining Text To Speech Synthesis0
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control0
Automatic Evaluation of Speaker Similarity0
Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis0
Denoising Text to Speech with Frame-Level Noise Modeling0
Automatic Arabic Dialect Identification Systems for Written Texts: A Survey0
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition0
A Unified Transformer-based Framework for Duplex Text Normalization0
An End-to-End Neural Network for Image-to-Audio Transformation0
Deliberation Model for On-Device Spoken Language Understanding0
A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesis0
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis0
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese0
FoundationTTS: Text-to-Speech for ASR Customization with Generative Language Model0
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios0
A Unified Model For Voice and Accent Conversion In Speech and Singing using Self-Supervised Learning and Feature Extraction0
UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech0
A unified front-end framework for English text-to-speech synthesis0
Deep Text-to-Speech System with Seq2Seq Model0
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space0
FlexSpeech: Towards Stable, Controllable and Expressive Text-to-Speech0
Deep Shallow Fusion for RNN-T Personalization0
Deep Performer: Score-to-Audio Music Performance Synthesis0
A Unified Framework for Collecting Text-to-Speech Synthesis Datasets for 22 Indian Languages0
Deep Feed-forward Sequential Memory Networks for Speech Synthesis0
Augmenting text for spoken language understanding with Large Language Models0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
Deep Denoising Auto-encoder for Statistical Speech Synthesis0
DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation0
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework0
Debatts: Zero-Shot Debating Text-to-Speech Synthesis0
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Augmentation through Laundering Attacks for Audio Spoof Detection0
Data Redaction from Conditional Generative Models0
Show:102550
← PrevPage 11 of 29Next →

No leaderboard results yet.