| Seamless: Multilingual Expressive and Streaming Speech Translation | Dec 8, 2023 | automatic-speech-translationMachine Translation | CodeCode Available | 6 | 5 |
| MooER: LLM-based Speech Recognition and Translation Models from Moore Threads | Aug 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 | 5 |
| BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages | Nov 7, 2024 | automatic-speech-translationSynthetic Data Generation | CodeCode Available | 1 | 5 |
| End-to-End Automatic Speech Translation of Audiobooks | Feb 12, 2018 | automatic-speech-translationSpeech-to-Text | CodeCode Available | 0 | 5 |
| Word Level Timestamp Generation for Automatic Speech Recognition and Translation | May 21, 2025 | Automatic Speech Recognitionautomatic-speech-translation | CodeCode Available | 0 | 5 |
| Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic Transcripts | Jul 17, 2023 | automatic-speech-translationImitation Learning | CodeCode Available | 0 | 5 |
| SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation | Feb 27, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| EMMeTT: Efficient Multimodal Machine Translation Training | Sep 20, 2024 | automatic-speech-translationDecoder | —Unverified | 0 | 0 |
| ELITR Multilingual Live Subtitling: Demo and Strategy | Apr 1, 2021 | automatic-speech-translationTranslation | —Unverified | 0 | 0 |
| Chain-of-Thought Prompting for Speech Translation | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |