SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 601650 of 1419 papers

TitleStatusHype
Development of an Inclusive Educational Platform Using Open Technologies and Machine Learning: A Case Study on Accessibility Enhancement0
A New Approach to Voice Authenticity0
Development and Evaluation of Speech Synthesis Corpora for Latvian0
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability0
Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners0
Detection of Lexical Stress Errors in Non-Native (L2) English with Data Augmentation and Attention0
Automatic Evaluation of Turn-taking Cues in Conversational Speech Synthesis0
A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music0
Designing French Tale Corpora for Entertaining Text To Speech Synthesis0
Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control0
Automatic Evaluation of Speaker Similarity0
Enhancing Word-Level Semantic Representation via Dependency Structure for Expressive Text-to-Speech Synthesis0
Denoising Text to Speech with Frame-Level Noise Modeling0
Automatic Arabic Dialect Identification Systems for Written Texts: A Survey0
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition0
A Unified Transformer-based Framework for Duplex Text Normalization0
Deliberation Model for On-Device Spoken Language Understanding0
A unified sequence-to-sequence front-end model for Mandarin text-to-speech synthesis0
An End-to-End Neural Network for Image-to-Audio Transformation0
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis0
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese0
A Unified Model For Voice and Accent Conversion In Speech and Singing using Self-Supervised Learning and Feature Extraction0
Deep Text-to-Speech System with Seq2Seq Model0
A unified front-end framework for English text-to-speech synthesis0
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space0
Deep Shallow Fusion for RNN-T Personalization0
Deep Performer: Score-to-Audio Music Performance Synthesis0
A Unified Framework for Collecting Text-to-Speech Synthesis Datasets for 22 Indian Languages0
Deep Feed-forward Sequential Memory Networks for Speech Synthesis0
Augmenting text for spoken language understanding with Large Language Models0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios0
Deep Denoising Auto-encoder for Statistical Speech Synthesis0
DeepAudio-V1:Towards Multi-Modal Multi-Stage End-to-End Video to Speech and Audio Generation0
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework0
Debatts: Zero-Shot Debating Text-to-Speech Synthesis0
D-CAPTCHA++: A Study of Resilience of Deepfake CAPTCHA under Transferable Imperceptible Adversarial Attack0
Augmentation through Laundering Attacks for Audio Spoof Detection0
Data Redaction from Conditional Generative Models0
Data Processing for Optimizing Naturalness of Vietnamese Text-to-speech System0
Data Efficient Voice Cloning for Neural Singing Synthesis0
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech0
AudioVisual Speech Synthesis: A brief literature review0
AdaSpeech 3: Adaptive Text to Speech for Spontaneous Style0
Accented Text-to-Speech Synthesis with Limited Data0
Data Center Audio/Video Intelligence on Device (DAVID) -- An Edge-AI Platform for Smart-Toys0
Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios0
DASB -- Discrete Audio and Speech Benchmark0
DART: Disentanglement of Accent and Speaker Representation in Multispeaker Text-to-Speech0
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue0
Show:102550
← PrevPage 13 of 29Next →

No leaderboard results yet.