Speaker Anonymization with Phonetic Intermediate Representations Jul 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Speaker-independent neural formant synthesis Jun 2, 2023 Speech Synthesis
— Unverified 00 Speaker-independent raw waveform model for glottal excitation Apr 25, 2018 model Speech Synthesis
— Unverified 00 Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models May 15, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis Jun 3, 2021 Data Augmentation Speaker Verification
— Unverified 00 Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model Feb 16, 2024 Denoising Speech Enhancement
— Unverified 00 Speaking rate attention-based duration prediction for speed control TTS Oct 13, 2023 Attribute Speech Synthesis
— Unverified 00 Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention Oct 29, 2018 Speech Synthesis text-to-speech
— Unverified 00 Speak While You Think: Streaming Speech Synthesis During Text Generation Sep 20, 2023 Speech Synthesis Text Generation
— Unverified 00 SPEAK WITH YOUR HANDS Using Continuous Hand Gestures to control Articulatory Speech Synthesizer Feb 2, 2021 Speech Synthesis
— Unverified 00 SPEAK YOUR MIND! Towards Imagined Speech Recognition With Hierarchical Deep Learning Apr 8, 2019 Brain Computer Interface General Classification
— Unverified 00 SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis Jan 30, 2024 Generative Adversarial Network Speech Synthesis
— Unverified 00 Spectral Codecs: Improving Non-Autoregressive Speech Synthesis with Spectrogram-Based Audio Codecs Jun 7, 2024 Quantization Speech Synthesis
— Unverified 00 Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction May 8, 2013 Speech Synthesis Speech-to-Text
— Unverified 00 Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks Jul 26, 2024 Generative Adversarial Network Speech Enhancement
— Unverified 00 SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition Jan 31, 2024 Decoder Language Modeling
— Unverified 00 Speech denoising by parametric resynthesis Apr 2, 2019 Denoising Resynthesis
— Unverified 00 Speech earthquakes: scaling and universality in human voice Aug 5, 2014 Speech Synthesis
— Unverified 00 Speech inpainting: Context-based speech synthesis guided by video Jun 1, 2023 speech-recognition Speech Recognition
— Unverified 00 Speech-MLP: a simple MLP architecture for speech processing Sep 29, 2021 Keyword Spotting Speech Enhancement
— Unverified 00 Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis Jul 8, 2025 Data Augmentation Mixture-of-Experts
— Unverified 00 Speech Recognition with Augmented Synthesized Speech Sep 25, 2019 Data Augmentation Diversity
— Unverified 00 Speech Rhythm-Based Speaker Embeddings Extraction from Phonemes and Phoneme Duration for Multi-Speaker Speech Synthesis Feb 11, 2024 Rhythm Speaker Identification
— Unverified 00 Speech Synthesis along Perceptual Voice Quality Dimensions Jan 15, 2025 Expressive Speech Synthesis Speech Synthesis
— Unverified 00 Speech Synthesis as Augmentation for Low-Resource ASR Dec 23, 2020 Data Augmentation speech-recognition
— Unverified 00 Speech Synthesis for Low Resource Languages using Transliteration Enabled Transfer Learning Nov 16, 2021 speech-recognition Speech Recognition
— Unverified 00 Speech Synthesis of Code-Mixed Text May 1, 2016 Language Identification Speech Synthesis
— Unverified 00 Speech Synthesis using EEG Feb 22, 2020 EEG Electroencephalogram (EEG)
— Unverified 00 Speech Synthesis with Mixed Emotions Aug 11, 2022 Attribute Emotional Speech Synthesis
— Unverified 00 Speech vocoding for laboratory phonology Jan 22, 2016 Speech Synthesis text-to-speech
— Unverified 00 Speech Synthesis By Unrolling Diffusion Process using Neural Network Layers Sep 18, 2023 Denoising Speech Synthesis
— Unverified 00 Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models Jul 18, 2024 Language Modeling Language Modelling
— Unverified 00 Sprachsynthese -- State-of-the-Art in englischer und deutscher Sprache Jun 11, 2021 Speech Synthesis
— Unverified 00 Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting Dec 28, 2024 Speech Synthesis text-to-speech
— Unverified 00 Statistical Evaluation of Pronunciation Encoding May 1, 2012 Speech Recognition Speech Synthesis
— Unverified 00 Statistical Parametric Speech Synthesis Using Bottleneck Representation From Sequence Auto-encoder Jun 19, 2016 Speech Synthesis
— Unverified 00 Strategies in Transfer Learning for Low-Resource Speech Synthesis: Phone Mapping, Features Input, and Source Language Selection Jun 21, 2023 Automatic Speech Recognition speech-recognition
— Unverified 00 StreamVC: Real-Time Low-Latency Voice Conversion Jan 5, 2024 Speech Synthesis Voice Conversion
— Unverified 00 StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis Sep 24, 2024 Speech Synthesis text-to-speech
— Unverified 00 Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis Dec 13, 2022 Data Augmentation Speech Synthesis
— Unverified 00 Style Mixture of Experts for Expressive Text-To-Speech Synthesis Jun 5, 2024 Mixture-of-Experts Speech Synthesis
— Unverified 00 STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech Mar 17, 2021 Speech Synthesis Style Transfer
— Unverified 00 StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis Dec 19, 2023 Decoder Speech Synthesis
— Unverified 00 StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion Sep 16, 2024 Speech Synthesis text-to-speech
— Unverified 00 Style Variation as a Vantage Point for Code-Switching May 1, 2020 Language Modeling Language Modelling
— Unverified 00 SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System Mar 29, 2025 Speech Synthesis text-to-speech
— Unverified 00 SUT System Description for Anti-Spoofing 2017 Challenge Nov 1, 2017 Quantization Speaker Verification
— Unverified 00 SwissDial: Parallel Multidialectal Corpus of Spoken Swiss German Mar 21, 2021 Speech Synthesis
— Unverified 00 Syllabification by Phone Categorization Jul 15, 2018 Retrieval speech-recognition
— Unverified 00 SynCLR: A Synthesis Framework for Contrastive Learning of out-of-domain Speech Representations Sep 29, 2021 Contrastive Learning Data Augmentation
— Unverified 00