| Seamless: Multilingual Expressive and Streaming Speech Translation | Dec 8, 2023 | automatic-speech-translationMachine Translation | CodeCode Available | 6 |
| MooER: LLM-based Speech Recognition and Translation Models from Moore Threads | Aug 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages | Nov 7, 2024 | automatic-speech-translationSynthetic Data Generation | CodeCode Available | 1 |
| Chain-of-Thought Prompting for Speech Translation | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ELITR Multilingual Live Subtitling: Demo and Strategy | Apr 1, 2021 | automatic-speech-translationTranslation | —Unverified | 0 |
| EMMeTT: Efficient Multimodal Machine Translation Training | Sep 20, 2024 | automatic-speech-translationDecoder | —Unverified | 0 |
| Development of Hybrid ASR Systems for Low Resource Medical Domain Conversational Telephone Speech | Oct 24, 2022 | automatic-speech-translationTranslation | —Unverified | 0 |
| Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities | May 13, 2025 | automatic-speech-translationBenchmarking | —Unverified | 0 |
| Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep Net Features for Complex Emotion Recognition | Oct 31, 2018 | automatic-speech-translationEmotion Recognition | —Unverified | 0 |