| Training end-to-end speech-to-text models on mobile phones | Dec 7, 2021 | CPUSpeech-to-Text | —Unverified | 0 |
| Improve Sinhala Speech Recognition Through e2e LF-MMI Model | Dec 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Experiment on Speech-to-Text Translation Systems for Manipuri to English on Low Resource Setting | Dec 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Impact of Microphone position Measurement Error on Multi Channel Distant Speech Recognition & Intelligibility | Dec 1, 2021 | Distant Speech RecognitionPosition | —Unverified | 0 |
| Cross Attention Augmented Transducer Networks for Simultaneous Translation | Nov 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Scribosermo: Fast Speech-to-Text models for German and other Languages | Oct 15, 2021 | Speech RecognitionSpeech-to-Text | CodeCode Available | 0 |
| Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems | Oct 13, 2021 | SentenceSimultaneous Speech-to-Text Translation | —Unverified | 0 |
| Comparison of SVD and factorized TDNN approaches for speech to text | Oct 13, 2021 | Speech-to-Text | —Unverified | 0 |
| A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation | Oct 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automated Testing of AI Models | Oct 7, 2021 | FairnessSpeech-to-Text | —Unverified | 0 |
| EdiTTS: Score-based Editing for Controllable Text-to-Speech | Oct 6, 2021 | Speech SynthesisSpeech-to-Text | CodeCode Available | 1 |
| Late reverberation suppression using U-nets | Oct 5, 2021 | DecoderSpeech Dereverberation | CodeCode Available | 1 |
| Challenges and Opportunities of Speech Recognition for Bengali Language | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio Interval Retrieval using Convolutional Neural Networks | Sep 21, 2021 | Audio ClassificationRetrieval | —Unverified | 0 |
| Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition | Sep 19, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Infusing Future Information into Monotonic Attention Through Language Models | Sep 7, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Speech Emotion Recognition with Multi-Task Learning | Sep 6, 2021 | Emotion ClassificationEmotion Recognition | CodeCode Available | 1 |
| One TTS Alignment To Rule Them All | Aug 23, 2021 | AllSpeech Synthesis | CodeCode Available | 1 |
| With One Voice: Composing a Travel Voice Assistant from Re-purposed Models | Aug 4, 2021 | BIG-bench Machine Learningnamed-entity-recognition | —Unverified | 0 |
| Corpus Creation and Evaluation for Speech-to-Text and Speech Translation | Aug 1, 2021 | Machine TranslationSpeech-to-Text | —Unverified | 0 |
| BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text | Aug 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multilingual Speech Translation from Efficient Finetuning of Pretrained Models | Aug 1, 2021 | DecoderSpeech-to-Text | —Unverified | 0 |
| A Large-Scale Chinese Multimodal NER Dataset with Speech Clues | Aug 1, 2021 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 1 |
| Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task | Jul 12, 2021 | DecoderKnowledge Distillation | —Unverified | 0 |
| Kosp2e: Korean Speech to English Translation Corpus | Jul 6, 2021 | speech-recognitionSpeech Recognition | CodeCode Available | 1 |
| The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 | Jul 1, 2021 | Data AugmentationSpeech-to-Text | —Unverified | 0 |
| Towards Automatic Speech to Sign Language Generation | Jun 24, 2021 | Speech-to-TextText Generation | CodeCode Available | 1 |
| Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling | Jun 21, 2021 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR | Jun 11, 2021 | Simultaneous Speech-to-Text TranslationSpeech-to-Text | —Unverified | 0 |
| TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS | Jun 10, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Design of Strategic Task Recommendations for Sustainable Crowdsourcing-Based Content Moderation | Jun 4, 2021 | Recommendation SystemsSpeech-to-Text | —Unverified | 0 |
| Findings of the Second Workshop on Automatic Simultaneous Translation | Jun 1, 2021 | Machine TranslationSpeech-to-Text | —Unverified | 0 |
| Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering | Jun 1, 2021 | Knowledge GraphsQuestion Answering | —Unverified | 0 |
| Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation | May 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice | May 10, 2021 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Learning Shared Semantic Space for Speech-to-Text Translation | May 7, 2021 | Machine TranslationSpeech-to-Text | CodeCode Available | 1 |
| A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect | May 7, 2021 | BenchmarkingSpeech-to-Text | —Unverified | 0 |
| Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models | Apr 25, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-end Speech Translation via Cross-modal Progressive Training | Apr 21, 2021 | Machine TranslationSpeech-to-Text | CodeCode Available | 1 |
| Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers | Apr 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition | Apr 5, 2021 | speech-recognitionSpeech Recognition | CodeCode Available | 0 |
| Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems | Mar 15, 2021 | Speech-to-Text | —Unverified | 0 |
| Towards Robust Speech-to-Text Adversarial Attack | Mar 15, 2021 | Adversarial AttackRoom Impulse Response (RIR) | —Unverified | 0 |
| Towards the evaluation of automatic simultaneous speech translation from a communicative perspective | Mar 15, 2021 | automatic-speech-translationInformativeness | —Unverified | 0 |
| Inductive biases, pretraining and fine-tuning jointly account for brain responses to speech | Feb 25, 2021 | Scene ClassificationSpeech-to-Text | —Unverified | 0 |
| NUVA: A Naming Utterance Verifier for Aphasia Treatment | Feb 10, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio Adversarial Examples: Attacks Using Vocal Masks | Feb 4, 2021 | Adversarial AttackSpeech-to-Text | —Unverified | 0 |
| Graph Neural Networks to Predict Customer Satisfaction Following Interactions with a Corporate Call Center | Jan 31, 2021 | Graph Neural NetworkSpeech-to-Text | —Unverified | 0 |
| BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge | Jan 29, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm | Jan 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |