PoeticTTS -- Controllable Poetry Reading for Literary Studies Jul 11, 2022 Speech Synthesis
— Unverified 0Speaker Anonymization with Phonetic Intermediate Representations Jul 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis Jul 8, 2022 Lip to Speech Synthesis Speech Synthesis
Code Code Available 0End-to-End Binaural Speech Synthesis Jul 8, 2022 Decoder Speech Synthesis
— Unverified 0Mix and Match: An Empirical Study on Training Corpus Composition for Polyglot Text-To-Speech (TTS) Jul 4, 2022 Speech Synthesis text-to-speech
— Unverified 0BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model Jul 4, 2022 Language Modeling Language Modelling
— Unverified 0Computer-assisted Pronunciation Training -- Speech synthesis is almost all you need Jul 2, 2022 All Speech Synthesis
— Unverified 0TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder Jun 30, 2022 Speech Synthesis text-to-speech
— Unverified 0R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS Jun 30, 2022 Decoder GPU
— Unverified 0iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis based on Disentanglement between Prosody and Timbre Jun 29, 2022 Disentanglement Speaker Identification
— Unverified 0Expressive, Variable, and Controllable Duration Modelling in TTS Jun 28, 2022 Normalising Flows Speech Synthesis
— Unverified 0Self-supervised Context-aware Style Representation for Expressive Speech Synthesis Jun 25, 2022 Contrastive Learning Deep Clustering
— Unverified 0WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis Jun 20, 2022 CPU Speech Synthesis
— Unverified 0Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History Jun 16, 2022 Self-Supervised Learning Sentence
— Unverified 0VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection Jun 15, 2022 feature selection Speech Synthesis
— Unverified 0Unsupervised TTS Acoustic Modeling for TTS with Conditional Disentangled Sequential VAE Jun 6, 2022 Representation Learning Speech Representation Learning
— Unverified 0Pronunciation Dictionary-Free Multilingual Speech Synthesis by Combining Unsupervised and Supervised Phonetic Representations Jun 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments Jun 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BU-TTS: An Open-Source, Bilingual Welsh-English, Text-to-Speech Corpus Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 0SyntAct: A Synthesized Database of Basic Emotions Jun 1, 2022 Emotion Recognition Speech Emotion Recognition
— Unverified 0AiRO - an Interactive Learning Tool for Children at Risk of Dyslexia Jun 1, 2022 Speech Synthesis
— Unverified 0Exploring Transfer Learning for Urdu Speech Synthesis Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 0Investigating Inter- and Intra-speaker Voice Conversion using Audiobooks Jun 1, 2022 Speech Synthesis text-to-speech
— Unverified 0Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish May 31, 2022 Machine Translation Speech Synthesis
Code Code Available 0SDS-200: A Swiss German Speech to Standard German Text Corpus May 19, 2022 Speech Synthesis Translation
Code Code Available 0Macedonian Speech Synthesis for Assistive Technology Applications May 18, 2022 Deep Learning Pitch control
— Unverified 0Read the Room: Adapting a Robot's Voice to Ambient and Social Contexts May 10, 2022 Speech Synthesis Voice Conversion
Code Code Available 0ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence May 9, 2022 Speech Synthesis text-to-speech
— Unverified 0Attentive activation function for improving end-to-end spoofing countermeasure systems May 3, 2022 Speech Synthesis Voice Conversion
— Unverified 0Systematic Inequalities in Language Technology Performance across the World’s Languages May 1, 2022 Dependency Parsing Machine Translation
Code Code Available 0Improving Self-Supervised Learning-based MOS Prediction Networks Apr 23, 2022 Prediction Quantization
Code Code Available 0Exploration strategies for articulatory synthesis of complex syllable onsets Apr 20, 2022 Speech Synthesis
Code Code Available 0A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture Apr 12, 2022 Speech Synthesis
— Unverified 0Fine-grained Noise Control for Multispeaker Speech Synthesis Apr 11, 2022 Expressive Speech Synthesis Speech Synthesis
— Unverified 0The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance Apr 11, 2022 Speaker Verification Speech Synthesis
— Unverified 0MAESTRO: Matched Speech Text Representations through Modality Matching Apr 7, 2022 Language Modelling Self-Supervised Learning
— Unverified 0DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores Apr 7, 2022 Self-Supervised Learning Speech Synthesis
— Unverified 0Unsupervised Quantized Prosody Representation for Controllable Speech Synthesis Apr 7, 2022 Quantization Speech Synthesis
— Unverified 0Self-supervised learning for robust voice cloning Apr 7, 2022 Self-Supervised Learning Speech Synthesis
— Unverified 0SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis Apr 6, 2022 Speech Synthesis text-to-speech
— Unverified 0Simple and Effective Unsupervised Speech Synthesis Apr 6, 2022 speech-recognition Speech Recognition
— Unverified 0A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality Apr 5, 2022 Benchmarking Self-Supervised Learning
— Unverified 0VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature Apr 2, 2022 Speech Synthesis text-to-speech
— Unverified 0Universal Adaptor: Converting Mel-Spectrograms Between Different Configurations for Speech Synthesis Apr 1, 2022 Speech Synthesis Voice Conversion
Code Code Available 0Residual-guided Personalized Speech Synthesis based on Face Image Apr 1, 2022 Speech Synthesis
— Unverified 0AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios Apr 1, 2022 Speech Synthesis text-to-speech
— Unverified 0WavThruVec: Latent speech representation as intermediate features for neural speech synthesis Mar 31, 2022 Speech Synthesis text-to-speech
— Unverified 0Applying Syntaxx2013Prosody Mapping Hypothesis and Prosodic Well-Formedness Constraints to Neural Sequence-to-Sequence Speech Synthesis Mar 29, 2022 Speech Synthesis text-to-speech
— Unverified 0Analysis of Voice Conversion and Code-Switching Synthesis Using VQ-VAE Mar 28, 2022 Speech Synthesis Voice Conversion
— Unverified 0Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis Mar 23, 2022 Expressive Speech Synthesis Knowledge Distillation
— Unverified 0