SOTAVerified

Text to Speech

import gTTS import os def text_to_speech_kurdish(text, output_file="output.mp3"): # گۆڕینی نووسین بۆ دەنگ بە زمانی کوردی (هەڵبژاردنی زمانی "ku" بۆ کوردی) tts = gTTS(text=text, lang='ku', slow=False) tts.save(output_file) os.system(f"start {output_file}") # کردنەوەی فایلە دەنگییەکە (لە Windows) # نموونە: text_to_speech_kurdish("سڵاو، ئەمە دەنگی منە بە زمانی کوردی.")

Papers

Showing 801850 of 1419 papers

TitleStatusHype
An Empirical Study on L2 Accents of Cross-lingual Text-to-Speech Systems via Vowel Space0
An End-to-End Neural Network for Image-to-Audio Transformation0
A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music0
A New Approach to Voice Authenticity0
An Exhaustive Evaluation of TTS- and VC-based Data Augmentation for ASR0
An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis0
An Expert System for Automatic Reading of A Text Written in Standard Arabic0
An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS0
An Implementation of Back-Propagation Learning on GF11, a Large SIMD Parallel Computer0
An In-depth Analysis of the Effect of Text Normalization in Social Media0
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS0
An objective evaluation of the effects of recording conditions and speaker characteristics in multi-speaker deep neural speech synthesis0
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy0
A Novel Approach to OCR using Image Recognition based Classification for Ancient Tamil Inscriptions in Temples0
A Novel Chinese Dialect TTS Frontend with Non-Autoregressive Neural Machine Translation0
A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions0
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era0
An overview of text-to-speech systems and media applications0
Anti-Spoofing Using Transfer Learning with Variational Information Bottleneck0
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person0
Grad-StyleSpeech: Any-speaker Adaptive Text-to-Speech Synthesis with Diffusion Models0
基於字元階層之語音合成用文脈訊息擷取 (Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese]0
基於字元階層之語音合成用文脈訊息擷取(Character-Level Linguistic Features Extraction for Text-to-Speech System) [In Chinese]0
A Polyphone BERT for Polyphone Disambiguation in Mandarin Chinese0
Application of ASV for Voice Identification after VC and Duration Predictor Improvement in TTS Models0
Applying Automated Machine Translation to Educational Video Courses0
Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech0
Applying Syntaxx2013Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis0
A Practical Guide to Logical Access Voice Presentation Attack Detection0
A Preliminary Analysis of Automatic Word and Syllable Prominence Detection in Non-Native Speech With Text-to-Speech Prosody Embeddings0
A Proposal of Automatic Error Correction in Text0
Arabic Text-To-Speech (TTS) Data Preparation0
A review-based study on different Text-to-Speech technologies0
A Review of Deep Learning Techniques for Speech Processing0
A Review of Multi-Modal Large Language and Vision Models0
ArmanTTS single-speaker Persian dataset0
Artificial Eye for the Blind0
A Self-Refining Framework for Enhancing ASR Using TTS-Synthesized Data0
A Simple Baseline for Domain Adaptation in End to End ASR Systems Using Synthetic Data0
A Speech-enabled Fixed-phrase Translator for Healthcare Accessibility0
ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation0
AS-Speech: Adaptive Style For Speech Synthesis0
A Study of Modeling Rising Intonation in Cantonese Neural Speech Synthesis0
A Study of Non-autoregressive Model for Sequence Generation0
A Study on Altering the Latent Space of Pretrained Text to Speech Models for Improved Expressiveness0
A study on the efficacy of model pre-training in developing neural text-to-speech system0
A Survey on Audio Synthesis and Audio-Visual Multimodal Processing0
ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech0
Asynchronous Tool Usage for Real-Time Agents0
A System for Diacritizing Four Varieties of Arabic0
Show:102550
← PrevPage 17 of 29Next →

No leaderboard results yet.