| Speculative End-Turn Detector for Efficient Speech Chatbot Assistant | Mar 30, 2025 | ChatbotCollaborative Inference | —Unverified | 0 |
| Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction | May 8, 2013 | Speech SynthesisSpeech-to-Text | —Unverified | 0 |
| Speech Aware Dialog System Technology Challenge (DSTC11) | Dec 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks | Jul 26, 2024 | Generative Adversarial NetworkSpeech Enhancement | —Unverified | 0 |
| Speech BERT Embedding For Improving Prosody in Neural TTS | Jun 8, 2021 | Decodertext-to-speech | —Unverified | 0 |
| Speech denoising by parametric resynthesis | Apr 2, 2019 | DenoisingResynthesis | —Unverified | 0 |
| Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody? | Oct 31, 2024 | Rhythmspeech-recognition | —Unverified | 0 |
| Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis | Jul 8, 2025 | Data AugmentationMixture-of-Experts | —Unverified | 0 |
| Speech Synthesis along Perceptual Voice Quality Dimensions | Jan 15, 2025 | Expressive Speech SynthesisSpeech Synthesis | —Unverified | 0 |
| Speech Synthesis for Low Resource Languages using Transliteration Enabled Transfer Learning | Nov 16, 2021 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Speech Synthesis of Code-Mixed Text | May 1, 2016 | Language IdentificationSpeech Synthesis | —Unverified | 0 |
| Speech Synthesis with Mixed Emotions | Aug 11, 2022 | AttributeEmotional Speech Synthesis | —Unverified | 0 |
| Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation | May 30, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Speech to Speech Translation with Translatotron: A State of the Art Review | Feb 9, 2025 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Speech to text and text to speech recognition systems-Areview | Mar 17, 2018 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Speech-T: Transducer for Text to Speech and Beyond | Dec 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech vocoding for laboratory phonology | Jan 22, 2016 | Speech Synthesistext-to-speech | —Unverified | 0 |
| SpeechX: Neural Codec Language Model as a Versatile Speech Transformer | Aug 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SpMis: An Investigation of Synthetic Spoken Misinformation Detection | Sep 17, 2024 | Misinformationtext-to-speech | —Unverified | 0 |
| Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models | Jul 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SpoofCeleb: Speech Deepfake Detection and SASV In The Wild | Sep 18, 2024 | DeepFake DetectionDiversity | —Unverified | 0 |
| Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech | May 27, 2025 | Style Transfertext-to-speech | —Unverified | 0 |
| SQuId: Measuring Speech Naturalness in Many Languages | Oct 12, 2022 | Diversitytext-to-speech | —Unverified | 0 |
| kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech | Aug 20, 2024 | RetrievalSelf-Supervised Learning | —Unverified | 0 |
| Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting | Dec 28, 2024 | Speech Synthesistext-to-speech | —Unverified | 0 |
| StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations | Apr 23, 2024 | text-to-speechText to Speech | —Unverified | 0 |
| Streaming Non-Autoregressive Model for Accent Conversion and Pronunciation Improvement | Jun 19, 2025 | text-to-speechText to Speech | —Unverified | 0 |
| Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation | Feb 4, 2025 | Change DetectionGender Classification | —Unverified | 0 |
| StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling | Jun 14, 2025 | text-to-speechText to Speech | —Unverified | 0 |
| Structural Analysis of Hindi Phonetics and A Method for Extraction of Phonetically Rich Sentences from a Very Large Hindi Text Corpus | Jan 30, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Structured State Space Decoder for Speech Recognition and Synthesis | Oct 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions | May 30, 2023 | AllAutomatic Speech Recognition | —Unverified | 0 |
| STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent | Mar 28, 2022 | text-to-speechText to Speech | —Unverified | 0 |
| Study of Indian English Pronunciation Variabilities relative to Received Pronunciation | Apr 13, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech | Nov 4, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN | Oct 27, 2023 | DecoderDenoising | —Unverified | 0 |
| Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models | Oct 6, 2021 | text-to-speechText to Speech | —Unverified | 0 |
| StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis | Sep 24, 2024 | Speech Synthesistext-to-speech | —Unverified | 0 |
| Style Mixture of Experts for Expressive Text-To-Speech Synthesis | Jun 5, 2024 | Mixture-of-ExpertsSpeech Synthesis | —Unverified | 0 |
| STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech | Mar 17, 2021 | Speech SynthesisStyle Transfer | —Unverified | 0 |
| Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation | Aug 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion | Sep 16, 2024 | Speech Synthesistext-to-speech | —Unverified | 0 |
| Style Variation as a Vantage Point for Code-Switching | May 1, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System | Mar 29, 2025 | Speech Synthesistext-to-speech | —Unverified | 0 |
| Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition | Jun 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SyncSpeech: Low-Latency and Efficient Dual-Stream Text-to-Speech based on Temporal Masked Transformer | Feb 16, 2025 | text-to-speechText to Speech | —Unverified | 0 |
| Syntactic representation learning for neural network based TTS with syntactic parse tree traversal | Dec 13, 2020 | DiversityRepresentation Learning | —Unverified | 0 |
| Synth2Aug: Cross-domain speaker recognition with TTS synthesized speech | Nov 24, 2020 | Data AugmentationSpeaker Recognition | —Unverified | 0 |
| Synth4Kws: Synthesized Speech for User Defined Keyword Spotting in Low Resource Environments | Jul 23, 2024 | DiversityKeyword Spotting | —Unverified | 0 |
| SynthASR: Unlocking Synthetic Data for Speech Recognition | Jun 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |