Russian Stress Prediction using Maximum Entropy Ranking Oct 1, 2013 Machine Translation Prediction
— Unverified 0Zero-Shot Streaming Text to Speech Synthesis with Transducer and Auto-Regressive Modeling May 26, 2025 Sentence Speech Synthesis
— Unverified 0Zero-shot text-to-speech synthesis conditioned using self-supervised speech representation model Apr 24, 2023 Rhythm Self-Supervised Learning
— Unverified 0S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation Jun 11, 2025 Reading Comprehension Speech Synthesis
— Unverified 0SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction Jun 2, 2025 Speech Synthesis text-to-speech
— Unverified 0SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis Aug 2, 2023 Decoder Self-Supervised Learning
— Unverified 0Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input Feb 19, 2021 Language Modeling Language Modelling
— Unverified 0Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis Dec 6, 2023 Speech Synthesis text-to-speech
— Unverified 0Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation May 16, 2020 Decoder Speech Synthesis
— Unverified 0Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain Jun 3, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models May 23, 2023 Speech Synthesis text-to-speech
— Unverified 0Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis May 18, 2025 Speech Synthesis text-to-speech
— Unverified 0Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Nov 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs Jul 18, 2023 Generative Adversarial Network Language Modeling
— Unverified 0SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis Apr 6, 2022 Speech Synthesis text-to-speech
— Unverified 0Speaker-independent raw waveform model for glottal excitation Apr 25, 2018 model Speech Synthesis
— Unverified 0Speaker verification-derived loss and data augmentation for DNN-based multispeaker speech synthesis Jun 3, 2021 Data Augmentation Speaker Verification
— Unverified 0Aligning Opinions: Cross-Lingual Opinion Mining with Dependencies Jul 1, 2015 Coreference Resolution Named Entity Recognition (NER)
— Unverified 0Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention Oct 29, 2018 Speech Synthesis text-to-speech
— Unverified 0Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks Jul 26, 2024 Generative Adversarial Network Speech Enhancement
— Unverified 0Speech denoising by parametric resynthesis Apr 2, 2019 Denoising Resynthesis
— Unverified 0WebWOZ: A Platform for Designing and Conducting Web-based Wizard of Oz Experiments Aug 1, 2013 Machine Translation Speech Recognition
— Unverified 0Spontaneous Style Text-to-Speech Synthesis with Controllable Spontaneous Behaviors Based on Language Models Jul 18, 2024 Language Modeling Language Modelling
— Unverified 0A distributed cloud-based dialog system for conversational application development Sep 1, 2015 Speech Recognition Speech Synthesis
— Unverified 0Stable-TTS: Stable Speaker-Adaptive Text-to-Speech Synthesis via Prosody Prompting Dec 28, 2024 Speech Synthesis text-to-speech
— Unverified 0Dual Script E2E framework for Multilingual and Code-Switching ASR Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis Sep 24, 2024 Speech Synthesis text-to-speech
— Unverified 0Style Mixture of Experts for Expressive Text-To-Speech Synthesis Jun 5, 2024 Mixture-of-Experts Speech Synthesis
— Unverified 0What happens to diffusion model likelihood when your model is conditional? Sep 10, 2024 domain classification model
— Unverified 0A Challenge Set and Methods for Noun-Verb Ambiguity Oct 1, 2018 Speech Synthesis text-to-speech
— Unverified 0StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion Sep 16, 2024 Speech Synthesis text-to-speech
— Unverified 0Style Variation as a Vantage Point for Code-Switching May 1, 2020 Language Modeling Language Modelling
— Unverified 0SynPaFlex-Corpus: An Expressive French Audiobooks Corpus dedicated to expressive speech synthesis. May 1, 2018 Expressive Speech Synthesis Speech Synthesis
— Unverified 0What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS Sep 4, 2020 Decoder Sentence
— Unverified 0Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages Nov 1, 2022 Chunking Rhythm
— Unverified 0Text-free non-parallel many-to-many voice conversion using normalising flows Mar 15, 2022 Normalising Flows Speech Synthesis
— Unverified 0Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis Mar 27, 2023 All Automatic Speech Recognition
— Unverified 0Text Normalization and Unit Selection for a Memory Based Non Uniform Unit Selection TTS in Malayalam Dec 1, 2015 Speech Synthesis Text Normalization
— Unverified 0Texto4Science: a Quebec French Database of Annotated Short Text Messages May 1, 2012 Speech Synthesis Text-To-Speech Synthesis
— Unverified 0Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder Dec 16, 2022 Representation Learning Speech Synthesis
— Unverified 0Text-To-Speech Synthesis In The Wild Sep 13, 2024 Benchmarking Speaker Recognition
— Unverified 0The FruitShell French synthesis system at the Blizzard 2023 Challenge Sep 1, 2023 Data Augmentation Speech Synthesis
— Unverified 0The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance Apr 11, 2022 Speaker Verification Speech Synthesis
— Unverified 0The Theory behind Controllable Expressive Speech Synthesis: a Cross-disciplinary Approach Oct 14, 2019 Expressive Speech Synthesis Sociology
— Unverified 0The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains Oct 4, 2023 Speech Synthesis text-to-speech
— Unverified 0Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion Apr 6, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Accented Text-to-Speech Synthesis with Limited Data May 8, 2023 Speech Synthesis text-to-speech
— Unverified 0Towards Fully Automatic Annotation of Audio Books for TTS May 1, 2012 Speech Recognition Speech Synthesis
— Unverified 0AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis Apr 14, 2025 RAG Retrieval-augmented Generation
— Unverified 0Autoregressive Speech Synthesis without Vector Quantization Jul 11, 2024 Audio Compression Diversity
— Unverified 0