| Pre-training for Speech Translation: CTC Meets Optimal Transport | Jan 27, 2023 | Multi-Task LearningSpeech-to-Text | CodeCode Available | 1 |
| Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition | Jan 6, 2023 | Domain AdaptationGPU | —Unverified | 0 |
| Pushing the performances of ASR models on English and Spanish accents | Dec 22, 2022 | Speech-to-Text | —Unverified | 0 |
| WACO: Word-Aligned Contrastive Learning for Speech Translation | Dec 19, 2022 | Contrastive LearningSpeech-to-Text | CodeCode Available | 0 |
| M3ST: Mix at Three Levels for Speech Translation | Dec 7, 2022 | Data AugmentationDiversity | —Unverified | 0 |
| MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition | Nov 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Handling and extracting key entities from customer conversations using Speech recognition and Named Entity recognition | Nov 28, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search | Oct 31, 2022 | Emotion RecognitionNeural Architecture Search | —Unverified | 0 |
| Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili | Oct 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Speech Translation with Dynamic Latent Perceivers | Oct 28, 2022 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 |