Rhythm-controllable Attention with High Robustness for Long Sentence Speech Synthesis Jun 5, 2023 Rhythm Sentence
— Unverified 00 R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS Jun 30, 2022 Decoder GPU
— Unverified 00 Robotic Speech Synthesis: Perspectives on Interactions, Scenarios, and Ethics Mar 17, 2022 Ethics Speech Synthesis
— Unverified 00 RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations Jul 3, 2023 Lip to Speech Synthesis Speaker-Specific Lip to Speech Synthesis
— Unverified 00 Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization Jul 2, 2024 Inference Optimization Speech Synthesis
— Unverified 00 RoVo: Robust Voice Protection Against Unauthorized Speech Synthesis with Embedding-Level Perturbations May 19, 2025 Speaker Verification Speech Enhancement
— Unverified 00 RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus May 1, 2014 Speech Synthesis text-to-speech
— Unverified 00 RUSLAN: Russian Spoken Language Corpus for Speech Synthesis Jun 26, 2019 Speech Synthesis text-to-speech
— Unverified 00 Russian Stress Prediction using Maximum Entropy Ranking Oct 1, 2013 Machine Translation Prediction
— Unverified 00 S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation Jun 11, 2025 Reading Comprehension Speech Synthesis
— Unverified 00 SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction Jun 2, 2025 Speech Synthesis text-to-speech
— Unverified 00 SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis Aug 2, 2023 Decoder Self-Supervised Learning
— Unverified 00 Sampling-based speech parameter generation using moment-matching networks Apr 12, 2017 Speech Synthesis
— Unverified 00 Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems Jul 31, 2018 Speech Synthesis
— Unverified 00 Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Dec 6, 2023 Speech Synthesis text-to-speech
— Unverified 00 Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks Feb 18, 2019 Speech Synthesis Voice Cloning
— Unverified 00 Seeing Voices: Generating A-Roll Video from Audio with Mirage Jun 9, 2025 Speech Synthesis text-to-speech
— Unverified 00 SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection Aug 30, 2024 Self-Supervised Learning Speech Synthesis
— Unverified 00 Self-Attention Linguistic-Acoustic Decoder Aug 31, 2018 CPU Decoder
— Unverified 00 Self-supervised Context-aware Style Representation for Expressive Speech Synthesis Jun 25, 2022 Contrastive Learning Deep Clustering
— Unverified 00 Self-supervised learning for robust voice cloning Apr 7, 2022 Self-Supervised Learning Speech Synthesis
— Unverified 00 SelfVC: Voice Conversion With Iterative Refinement using Self Transformations Oct 14, 2023 Self-Supervised Learning Speaker Verification
— Unverified 00 Semantics and Discourse Processing for Expressive TTS Sep 1, 2015 Speech Synthesis
— Unverified 00 SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement Jun 13, 2020 CPU GPU
— Unverified 00 Semi-Supervised Generative Modeling for Controllable Speech Synthesis Oct 3, 2019 Speech Synthesis text-to-speech
— Unverified 00 Semi-Supervised Learning Based on Reference Model for Low-resource TTS Oct 25, 2022 Speech Synthesis text-to-speech
— Unverified 00 Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation May 16, 2020 Decoder Speech Synthesis
— Unverified 00 Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis Aug 30, 2018 Decoder Speech Synthesis
— Unverified 00 Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System Dec 1, 2020 Articles Emotional Speech Synthesis
— Unverified 00 Sequence Modeling using Gated Recurrent Neural Networks Jan 1, 2015 Machine Translation Speech Synthesis
— Unverified 00 Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis May 18, 2025 Speech Synthesis text-to-speech
— Unverified 00 Significance of Maximum Spectral Amplitude in Sub-bands for Spectral Envelope Estimation and Its Application to Statistical Parametric Speech Synthesis Aug 3, 2015 Speech Synthesis
— Unverified 00 Silent Speech Interfaces for Speech Restoration: A Review Sep 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Simple and Effective Unsupervised Speech Synthesis Apr 6, 2022 speech-recognition Speech Recognition
— Unverified 00 Simple and Effective Unsupervised Speech Translation Oct 18, 2022 Domain Adaptation Machine Translation
— Unverified 00 Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Nov 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Simultaneous Translation Nov 1, 2020 Machine Translation speech-recognition
— Unverified 00 SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation Oct 14, 2021 Generative Adversarial Network GPU
— Unverified 00 Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation Sep 17, 2024 Knowledge Distillation Speech Synthesis
— Unverified 00 Situated Incremental Natural Language Understanding using a Multimodal, Linguistically-driven Update Model Aug 1, 2014 Dialogue Management Natural Language Understanding
— Unverified 00 SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow Apr 10, 2025 Speech Synthesis text-to-speech
— Unverified 00 SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs Jul 18, 2023 Generative Adversarial Network Language Modeling
— Unverified 00 SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech Nov 30, 2022 Speech Synthesis text-to-speech
— Unverified 00 SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation Apr 21, 2025 parameter-efficient fine-tuning Speech Synthesis
— Unverified 00 SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis Apr 6, 2022 Speech Synthesis text-to-speech
— Unverified 00 SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation Jul 27, 2022 Language Modeling Language Modelling
— Unverified 00 MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis Feb 26, 2025 Speech Synthesis text-to-speech
— Unverified 00 Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis Mar 2, 2022 Speech Synthesis
— Unverified 00 Speaker-adaptive neural vocoders for parametric speech synthesis systems Nov 8, 2018 Speech Synthesis text-to-speech
— Unverified 00 Speaker Anonymization Using X-vector and Neural Waveform Models May 30, 2019 Speaker anonymization Speaker Verification
— Unverified 00