| EdiTTS: Score-based Editing for Controllable Text-to-Speech | Oct 6, 2021 | Speech SynthesisSpeech-to-Text | CodeCode Available | 1 |
| Late reverberation suppression using U-nets | Oct 5, 2021 | DecoderSpeech Dereverberation | CodeCode Available | 1 |
| Challenges and Opportunities of Speech Recognition for Bengali Language | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio Interval Retrieval using Convolutional Neural Networks | Sep 21, 2021 | Audio ClassificationRetrieval | —Unverified | 0 |
| Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition | Sep 19, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Infusing Future Information into Monotonic Attention Through Language Models | Sep 7, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Speech Emotion Recognition with Multi-Task Learning | Sep 6, 2021 | Emotion ClassificationEmotion Recognition | CodeCode Available | 1 |
| One TTS Alignment To Rule Them All | Aug 23, 2021 | AllSpeech Synthesis | CodeCode Available | 1 |
| With One Voice: Composing a Travel Voice Assistant from Re-purposed Models | Aug 4, 2021 | BIG-bench Machine Learningnamed-entity-recognition | —Unverified | 0 |
| Corpus Creation and Evaluation for Speech-to-Text and Speech Translation | Aug 1, 2021 | Machine TranslationSpeech-to-Text | —Unverified | 0 |