Building Open Javanese and Sundanese Corpora for Multilingual Text-to-Speech May 1, 2018 Automatic Speech Recognition (ASR) Speech Recognition
— Unverified 0Speaker-independent raw waveform model for glottal excitation Apr 25, 2018 model Speech Synthesis
— Unverified 0A comparison of recent waveform generation and acoustic modeling methods for neural-network-based speech synthesis Apr 7, 2018 Speech Synthesis
— Unverified 0Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder Apr 6, 2018 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Neural Autoregressive Flows Apr 3, 2018 Density Estimation Speech Synthesis
Code Code Available 0Speech waveform synthesis from MFCC sequences with generative adversarial networks Apr 3, 2018 Generative Adversarial Network Speech Synthesis
Code Code Available 0High-quality nonparallel voice conversion based on cycle-consistent adversarial network Apr 2, 2018 Generative Adversarial Network Image-to-Image Translation
— Unverified 0Machine Speech Chain with One-shot Speaker Adaptation Mar 28, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron Mar 24, 2018 Expressive Speech Synthesis Speech Synthesis
Code Code Available 1Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis Mar 23, 2018 Speech Synthesis Style Transfer
Code Code Available 1Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data Mar 2, 2018 Generative Adversarial Network Speech Enhancement
— Unverified 0Deep Feed-forward Sequential Memory Networks for Speech Synthesis Feb 26, 2018 speech-recognition Speech Recognition
— Unverified 0Efficient Neural Audio Synthesis Feb 23, 2018 Audio Synthesis CPU
Code Code Available 2Fitting New Speakers Based on a Short Untranscribed Sample Feb 20, 2018 Speech Synthesis text-to-speech
— Unverified 0Tools and resources for Romanian text-to-speech and speech-to-text applications Feb 15, 2018 speech-recognition Speech Recognition
Code Code Available 0Neural Voice Cloning with a Few Samples Feb 14, 2018 Speech Synthesis Voice Cloning
Code Code Available 0Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals Jan 19, 2018 Speech Synthesis Voice Conversion
Code Code Available 0Synthesizing Audio for Hindi WordNet Jan 1, 2018 Speech Synthesis
— Unverified 0pyiwn: A Python based API to access Indian Language WordNets Jan 1, 2018 Speech Synthesis
— Unverified 0POLICY DRIVEN GENERATIVE ADVERSARIAL NETWORKS FOR ACCENTED SPEECH GENERATION Jan 1, 2018 Speech Synthesis
— Unverified 0HybridNet: A Hybrid Neural Architecture to Speed-up Autoregressive Models Jan 1, 2018 Speech Synthesis text-to-speech
— Unverified 0Merging K-means with hierarchical clustering for identifying general-shaped groups Dec 23, 2017 Clustering Density Estimation
— Unverified 0Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions Dec 16, 2017 Speech Synthesis
Code Code Available 1Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform Dec 13, 2017 Speech Synthesis text-to-speech
— Unverified 0Parallel WaveNet: Fast High-Fidelity Speech Synthesis Nov 28, 2017 Speech Synthesis Vocal Bursts Intensity Prediction
Code Code Available 0完全基於類神經網路之語音合成系統初步研究 (A Preliminary Study on Fully Neural Network-based Speech Synthesis System) [In Chinese] Nov 1, 2017 Speech Synthesis
— Unverified 0SUT System Description for Anti-Spoofing 2017 Challenge Nov 1, 2017 Quantization Speaker Verification
— Unverified 0Uncovering Latent Style Factors for Expressive Speech Synthesis Nov 1, 2017 Expressive Speech Synthesis Speech Synthesis
— Unverified 0JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis Oct 28, 2017 BIG-bench Machine Learning Speech Synthesis
Code Code Available 0Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning Oct 20, 2017 GPU Speech Synthesis
Code Code Available 0Statistical Parametric Speech Synthesis Incorporating Generative Adversarial Networks Sep 23, 2017 Speech Synthesis text-to-speech
Code Code Available 0Fast and Accurate Decision Trees for Natural Language Processing Tasks Sep 1, 2017 Attribute BIG-bench Machine Learning
— Unverified 0Using hyperlinks to improve multilingual partial parsers Sep 1, 2017 Machine Translation Speech Synthesis
Code Code Available 0Refer-iTTS: A System for Referring in Spoken Installments to Objects in Real-World Images Sep 1, 2017 Referring Expression Referring expression generation
— Unverified 0Lexicon for Natural Language Generation in Spanish Adapted to Alternative and Augmentative Communication Sep 1, 2017 Speech Recognition Speech Synthesis
— Unverified 0Listening while Speaking: Speech Chain by Deep Learning Jul 16, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Hidden-Markov-Model Based Speech Enhancement Jul 4, 2017 model Speech Enhancement
— Unverified 0PyDial: A Multi-domain Statistical Dialogue System Toolkit Jul 1, 2017 Dialogue Management Speech Recognition
— Unverified 0A Variational EM Method for Pole-Zero Modeling of Speech with Mixed Block Sparse and Gaussian Excitation Jun 24, 2017 speech-recognition Speech Recognition
— Unverified 0Deep Voice 2: Multi-Speaker Neural Text-to-Speech May 24, 2017 Speech Synthesis text-to-speech
Code Code Available 0I Probe, Therefore I Am: Designing a Virtual Journalist with Human Emotions May 18, 2017 Speech Synthesis
— Unverified 0Building and using language resources and infrastructure to develop e-learning programs for a minority language May 1, 2017 Language Acquisition Speech Synthesis
— Unverified 0Aligning phonemes using finte-state methods May 1, 2017 Speech Synthesis Spelling Correction
— Unverified 0Sampling-based speech parameter generation using moment-matching networks Apr 12, 2017 Speech Synthesis
— Unverified 0Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities Apr 10, 2017 speech-recognition Speech Recognition
— Unverified 0Toward a Web-based Speech Corpus for Algerian Dialectal Arabic Varieties Apr 1, 2017 Speech Recognition Speech Synthesis
— Unverified 0CASSANDRA: A multipurpose configurable voice-enabled human-computer-interface Apr 1, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Tacotron: Towards End-to-End Speech Synthesis Mar 29, 2017 Audio Synthesis Speech Synthesis
Code Code Available 1Deep Voice: Real-time Neural Text-to-Speech Feb 25, 2017 Audio Synthesis Boundary Detection
Code Code Available 0DNN Filter Bank Cepstral Coefficients for Spoofing Detection Feb 13, 2017 Speaker Verification Speech Synthesis
— Unverified 0