Assessing Evaluation Metrics for Speech-to-Speech Translation Oct 26, 2021 Machine Translation Open-Ended Question Answering
— Unverified 0DelightfulTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2021 Oct 25, 2021 Speech Synthesis text-to-speech
Code Code Available 0CaloFlow II: Even Faster and Still Accurate Generation of Calorimeter Showers with Normalizing Flows Oct 21, 2021 Speech Synthesis
Code Code Available 0Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition Oct 21, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection Oct 18, 2021 Speech Synthesis Synthetic Speech Detection
Code Code Available 1From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation Oct 15, 2021 Data Augmentation Simultaneous Speech-to-Speech Translation
— Unverified 0Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention Oct 15, 2021 Simultaneous Speech-to-Speech Translation Speech Synthesis
— Unverified 0SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation Oct 14, 2021 Generative Adversarial Network GPU
— Unverified 0Systematic Inequalities in Language Technology Performance across the World's Languages Oct 13, 2021 Dependency Parsing Machine Translation
Code Code Available 0DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding Oct 13, 2021 Speech Synthesis Voice Conversion
— Unverified 0Fine-grained style control in Transformer-based Text-to-speech Synthesis Oct 12, 2021 Inductive Bias Speech Synthesis
Code Code Available 1LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example Oct 11, 2021 Speech Synthesis
— Unverified 0Using multiple reference audios and style embedding constraints for speech synthesis Oct 9, 2021 Sentence Sentence Similarity
— Unverified 0Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis Oct 9, 2021 Lifelong learning Speech Synthesis
Code Code Available 0Environment Aware Text-to-Speech Synthesis Oct 8, 2021 Attribute Disentanglement
— Unverified 0Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech Oct 8, 2021 Emotion Interpretation Expressive Speech Synthesis
Code Code Available 1VisualTTS: TTS with Accurate Lip-Speech Synchronization for Automatic Voice Over Oct 7, 2021 Speech Synthesis text-to-speech
— Unverified 0StrengthNet: Deep Learning-based Emotion Strength Assessment for Emotional Speech Synthesis Oct 7, 2021 Attribute Data Augmentation
Code Code Available 1Cloning one's voice using very limited data in the wild Oct 7, 2021 Speech Synthesis
— Unverified 0Mixer-TTS: non-autoregressive, fast and compact text-to-speech model conditioned on language model embeddings Oct 7, 2021 Language Modeling Language Modelling
Code Code Available 1GANtron: Emotional Speech Synthesis with Generative Adversarial Networks Oct 6, 2021 Emotional Speech Synthesis Speech Synthesis
— Unverified 0EdiTTS: Score-based Editing for Controllable Text-to-Speech Oct 6, 2021 Speech Synthesis Speech-to-Text
Code Code Available 1Prosody-TTS: An end-to-end speech synthesis system with prosody control Oct 6, 2021 Rhythm Speech Synthesis
— Unverified 0On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis Oct 4, 2021 Knowledge Distillation Speech Synthesis
— Unverified 0Neural Speech Synthesis in German Oct 3, 2021 Speech Synthesis text-to-speech
— Unverified 0Incorporating speaker embedding and post-filter network for improving speaker similarity of personalized speech synthesis system Oct 1, 2021 Speaker Verification Speech Synthesis
— Unverified 0Conditioning Sequence-to-sequence Networks with Learned Activations Sep 29, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech-MLP: a simple MLP architecture for speech processing Sep 29, 2021 Keyword Spotting Speech Enhancement
— Unverified 0SynCLR: A Synthesis Framework for Contrastive Learning of out-of-domain Speech Representations Sep 29, 2021 Contrastive Learning Data Augmentation
— Unverified 0Guided-TTS:Text-to-Speech with Untranscribed Speech Sep 29, 2021 Speech Synthesis text-to-speech
— Unverified 0Diffusion-Based Voice Conversion with Fast Maximum Likelihood Sampling Scheme Sep 28, 2021 Speech Synthesis Voice Conversion
Code Code Available 1FlowVocoder: A small Footprint Neural Vocoder based Normalizing flow for Speech Synthesis Sep 27, 2021 Density Estimation Speech Synthesis
— Unverified 0Low-Latency Incremental Text-to-Speech Synthesis with Distilled Context Prediction Network Sep 22, 2021 Knowledge Distillation Language Modeling
— Unverified 0"Hello, It's Me": Deep Learning-based Speech Synthesis Attacks in the Real World Sep 20, 2021 Deep Learning Speaker Recognition
— Unverified 0On-device neural speech synthesis Sep 17, 2021 GPU Speech Synthesis
— Unverified 0fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit Sep 14, 2021 Speech Synthesis text-to-speech
Code Code Available 0Referee: Towards reference-free cross-speaker style transfer with low-quality data for expressive speech synthesis Sep 8, 2021 Expressive Speech Synthesis Sentence
— Unverified 0Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection Sep 1, 2021 Speaker Verification Speech Synthesis
— Unverified 0Neural Sequence-to-Sequence Speech Synthesis Using a Hidden Semi-Markov Model Based Structured Attention Mechanism Aug 31, 2021 Speech Synthesis
— Unverified 0Neural HMMs are all you need (for high-quality attention-free TTS) Aug 30, 2021 All Speech Synthesis
Code Code Available 1Full Attention Bidirectional Deep Learning Structure for Single Channel Speech Enhancement Aug 27, 2021 Audio Signal Processing Speech Enhancement
— Unverified 0Integrated Speech and Gesture Synthesis Aug 25, 2021 Speech Synthesis text-to-speech
Code Code Available 0One TTS Alignment To Rule Them All Aug 23, 2021 All Speech Synthesis
Code Code Available 1A Unified Transformer-based Framework for Duplex Text Normalization Aug 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing audio quality for expressive Neural Text-to-Speech Aug 13, 2021 Acoustic Modelling Speech Synthesis
— Unverified 0A Streamwise GAN Vocoder for Wideband Speech Coding at Very Low Bit Rate Aug 9, 2021 Speech Synthesis
— Unverified 0Improved pronunciation prediction accuracy using morphology Aug 1, 2021 LEMMA Morphological Inflection
— Unverified 0Cross-speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis Jul 27, 2021 Expressive Speech Synthesis Speech Synthesis
— Unverified 0Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning Jul 21, 2021 Diversity Music Generation
Code Code Available 1