Source-Filter-Based Generative Adversarial Neural Vocoder for High Fidelity Speech Synthesis Apr 26, 2023 Speech Synthesis text-to-speech
Code Code Available 2RFWave: Multi-band Rectified Flow for Audio Waveform Reconstruction Mar 8, 2024 Audio Generation Computational Efficiency
Code Code Available 2Sample-Efficient Diffusion for Text-To-Speech Synthesis Sep 1, 2024 Language Modeling Language Modelling
Code Code Available 2P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting Sep 22, 2023 Decoder Speech Synthesis
Code Code Available 2Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram Oct 25, 2019 Generative Adversarial Network GPU
Code Code Available 2Neural Speech Synthesis with Transformer Network Sep 19, 2018 Decoder Machine Translation
Code Code Available 2FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis Apr 21, 2022 Denoising GPU
Code Code Available 2SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description Aug 24, 2024 Descriptive Speech Synthesis
Code Code Available 2OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis Jan 8, 2025 Decoder Emotional Speech Synthesis
Code Code Available 2Conditional Diffusion Probabilistic Model for Speech Enhancement Feb 10, 2022 model Speech Enhancement
Code Code Available 2RapFlow-TTS: Rapid and High-Fidelity Text-to-Speech with Improved Consistency Flow Matching Jun 20, 2025 Scheduling Speech Synthesis
Code Code Available 2LPCNet: Improving Neural Speech Synthesis Through Linear Prediction Oct 28, 2018 Prediction Speech Synthesis
Code Code Available 2NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers Apr 18, 2023 In-Context Learning Speech Synthesis
Code Code Available 2BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis Mar 25, 2022 Image Generation Speech Synthesis
Code Code Available 2A Vector Quantized Approach for Text to Speech Synthesis on Real-World Spontaneous Speech Feb 8, 2023 Code Generation Diversity
Code Code Available 2BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network Sep 6, 2023 Generative Adversarial Network Speech Synthesis
Code Code Available 2LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Oct 7, 2023 Audio captioning Automatic Speech Recognition
Code Code Available 2Lina-Speech: Gated Linear Attention is a Fast and Parameter-Efficient Learner for text-to-speech synthesis Oct 30, 2024 Speech Synthesis text-to-speech
Code Code Available 2Improving Opus Low Bit Rate Quality with Neural Speech Synthesis Aug 10, 2020 Decoder Speech Synthesis
Code Code Available 2HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform Sep 18, 2023 Speech Synthesis
Code Code Available 2iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform Mar 4, 2022 Speech Synthesis text-to-speech
Code Code Available 2GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech May 15, 2022 Speech Synthesis Style Transfer
Code Code Available 2Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows Mar 3, 2022 Speech Synthesis text-to-speech
Code Code Available 2CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model May 11, 2023 Denoising GPU
Code Code Available 2CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers and Consistency Models Mar 31, 2024 Denoising Speech Synthesis
Code Code Available 2Generative Adversarial Training for Text-to-Speech Synthesis Based on Raw Phonetic Input and Explicit Prosody Modelling Oct 14, 2023 Speech Synthesis text-to-speech
Code Code Available 2HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis Oct 12, 2020 CPU GPU
Code Code Available 2Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness Apr 10, 2024 Speech Synthesis text-to-speech
Code Code Available 2NaturalSpeech: End-to-End Text to Speech Synthesis with Human-Level Quality May 9, 2022 Sentence Speech Synthesis
Code Code Available 2Using Speech Synthesis to Train End-to-End Spoken Language Understanding Models Oct 21, 2019 Data Augmentation Natural Language Understanding
Code Code Available 2Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis May 12, 2020 Speech Synthesis Style Transfer
Code Code Available 1FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection Oct 18, 2021 Speech Synthesis Synthetic Speech Detection
Code Code Available 1Fine-Grained and Interpretable Neural Speech Editing Jul 7, 2024 Data Augmentation Speech Synthesis
Code Code Available 1FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis Oct 27, 2022 Speech Synthesis text-to-speech
Code Code Available 1Fine-grained style control in Transformer-based Text-to-speech Synthesis Oct 12, 2021 Inductive Bias Speech Synthesis
Code Code Available 1FonBund: A Library for Combining Cross-lingual Phonological Segment Data May 1, 2018 Language Modeling Language Modelling
Code Code Available 1FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis Jun 29, 2021 Speech Synthesis text-to-speech
Code Code Available 1FastSpeech 2: Fast and High-Quality End-to-End Text to Speech Jun 8, 2020 Knowledge Distillation Speech Synthesis
Code Code Available 1Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech Oct 1, 2023 speech-recognition Speech Recognition
Code Code Available 1Generative Expressive Conversational Speech Synthesis Jul 31, 2024 Speech Synthesis
Code Code Available 1Exploring Transfer Learning for Low Resource Emotional TTS Jan 14, 2019 Deep Learning Emotional Speech Synthesis
Code Code Available 1From Speaker Verification to Multispeaker Speech Synthesis, Deep Transfer with Feedback Constraint May 10, 2020 Speaker Verification Speech Synthesis
Code Code Available 1Enhancing Speech Intelligibility in Text-To-Speech Synthesis using Speaking Style Conversion Aug 13, 2020 Speech Synthesis text-to-speech
Code Code Available 1TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese May 11, 2020 Denoising Speech Synthesis
Code Code Available 1End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions May 19, 2022 Speech Synthesis Style Transfer
Code Code Available 1EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to Speech Jun 28, 2023 Emotion Recognition Speech Synthesis
Code Code Available 1EMNS /Imz/ Corpus: An emotive single-speaker dataset for narrative storytelling in games, television and graphic novels May 22, 2023 Expressive Speech Synthesis Speech Synthesis
Code Code Available 1Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling Dec 19, 2023 Contrastive Learning Speech Synthesis
Code Code Available 1APNet2: High-quality and High-efficiency Neural Vocoder with Direct Prediction of Amplitude and Phase Spectra Nov 20, 2023 Speech Synthesis
Code Code Available 1EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting Dec 31, 2020 Keyword Spotting Keyword Spotting CSS
Code Code Available 1