R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS Jun 30, 2022 Decoder GPU
— Unverified 0Robotic Speech Synthesis: Perspectives on Interactions, Scenarios, and Ethics Mar 17, 2022 Ethics Speech Synthesis
— Unverified 0RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations Jul 3, 2023 Lip to Speech Synthesis Speaker-Specific Lip to Speech Synthesis
— Unverified 0Robust Zero-Shot Text-to-Speech Synthesis with Reverse Inference Optimization Jul 2, 2024 Inference Optimization Speech Synthesis
— Unverified 0RoVo: Robust Voice Protection Against Unauthorized Speech Synthesis with Embedding-Level Perturbations May 19, 2025 Speaker Verification Speech Enhancement
— Unverified 0RSS-TOBI - A Prosodically Enhanced Romanian Speech Corpus May 1, 2014 Speech Synthesis text-to-speech
— Unverified 0RUSLAN: Russian Spoken Language Corpus for Speech Synthesis Jun 26, 2019 Speech Synthesis text-to-speech
— Unverified 0Russian Stress Prediction using Maximum Entropy Ranking Oct 1, 2013 Machine Translation Prediction
— Unverified 0S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation Jun 11, 2025 Reading Comprehension Speech Synthesis
— Unverified 0SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction Jun 2, 2025 Speech Synthesis text-to-speech
— Unverified 0SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis Aug 2, 2023 Decoder Self-Supervised Learning
— Unverified 0Sampling-based speech parameter generation using moment-matching networks Apr 12, 2017 Speech Synthesis
— Unverified 0Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems Jul 31, 2018 Speech Synthesis
— Unverified 0Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Dec 6, 2023 Speech Synthesis text-to-speech
— Unverified 0Securing Voice-driven Interfaces against Fake (Cloned) Audio Attacks Feb 18, 2019 Speech Synthesis Voice Cloning
— Unverified 0Seeing Voices: Generating A-Roll Video from Audio with Mirage Jun 9, 2025 Speech Synthesis text-to-speech
— Unverified 0SelectTTS: Synthesizing Anyone's Voice via Discrete Unit-Based Frame Selection Aug 30, 2024 Self-Supervised Learning Speech Synthesis
— Unverified 0Self-Attention Linguistic-Acoustic Decoder Aug 31, 2018 CPU Decoder
— Unverified 0Self-supervised Context-aware Style Representation for Expressive Speech Synthesis Jun 25, 2022 Contrastive Learning Deep Clustering
— Unverified 0Self-supervised learning for robust voice cloning Apr 7, 2022 Self-Supervised Learning Speech Synthesis
— Unverified 0SelfVC: Voice Conversion With Iterative Refinement using Self Transformations Oct 14, 2023 Self-Supervised Learning Speaker Verification
— Unverified 0Semantics and Discourse Processing for Expressive TTS Sep 1, 2015 Speech Synthesis
— Unverified 0SE-MelGAN -- Speaker Agnostic Rapid Speech Enhancement Jun 13, 2020 CPU GPU
— Unverified 0Semi-Supervised Generative Modeling for Controllable Speech Synthesis Oct 3, 2019 Speech Synthesis text-to-speech
— Unverified 0Semi-Supervised Learning Based on Reference Model for Low-resource TTS Oct 25, 2022 Speech Synthesis text-to-speech
— Unverified 0Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation May 16, 2020 Decoder Speech Synthesis
— Unverified 0Semi-Supervised Training for Improving Data Efficiency in End-to-End Speech Synthesis Aug 30, 2018 Decoder Speech Synthesis
— Unverified 0Sentiment Analysis for Emotional Speech Synthesis in a News Dialogue System Dec 1, 2020 Articles Emotional Speech Synthesis
— Unverified 0Sequence Modeling using Gated Recurrent Neural Networks Jan 1, 2015 Machine Translation Speech Synthesis
— Unverified 0Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis May 18, 2025 Speech Synthesis text-to-speech
— Unverified 0Significance of Maximum Spectral Amplitude in Sub-bands for Spectral Envelope Estimation and Its Application to Statistical Parametric Speech Synthesis Aug 3, 2015 Speech Synthesis
— Unverified 0Silent Speech Interfaces for Speech Restoration: A Review Sep 4, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Simple and Effective Unsupervised Speech Synthesis Apr 6, 2022 speech-recognition Speech Recognition
— Unverified 0Simple and Effective Unsupervised Speech Translation Oct 18, 2022 Domain Adaptation Machine Translation
— Unverified 0Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Nov 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Simultaneous Translation Nov 1, 2020 Machine Translation speech-recognition
— Unverified 0SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation Oct 14, 2021 Generative Adversarial Network GPU
— Unverified 0Single-stage TTS with Masked Audio Token Modeling and Semantic Knowledge Distillation Sep 17, 2024 Knowledge Distillation Speech Synthesis
— Unverified 0Situated Incremental Natural Language Understanding using a Multimodal, Linguistically-driven Update Model Aug 1, 2014 Dialogue Management Natural Language Understanding
— Unverified 0SlimSpeech: Lightweight and Efficient Text-to-Speech with Slim Rectified Flow Apr 10, 2025 Speech Synthesis text-to-speech
— Unverified 0SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs Jul 18, 2023 Generative Adversarial Network Language Modeling
— Unverified 0SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speaker text-to-speech Nov 30, 2022 Speech Synthesis text-to-speech
— Unverified 0SOLIDO: A Robust Watermarking Method for Speech Synthesis via Low-Rank Adaptation Apr 21, 2025 parameter-efficient fine-tuning Speech Synthesis
— Unverified 0SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis Apr 6, 2022 Speech Synthesis text-to-speech
— Unverified 0SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation Jul 27, 2022 Language Modeling Language Modelling
— Unverified 0MegaTTS 3: Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis Feb 26, 2025 Speech Synthesis text-to-speech
— Unverified 0Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis Mar 2, 2022 Speech Synthesis
— Unverified 0Speaker-adaptive neural vocoders for parametric speech synthesis systems Nov 8, 2018 Speech Synthesis text-to-speech
— Unverified 0Speaker Anonymization Using X-vector and Neural Waveform Models May 30, 2019 Speaker anonymization Speaker Verification
— Unverified 0Speaker Anonymization with Phonetic Intermediate Representations Jul 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0