| Seamless: Multilingual Expressive and Streaming Speech Translation | Dec 8, 2023 | automatic-speech-translationMachine Translation | CodeCode Available | 6 | 5 |
| MooER: LLM-based Speech Recognition and Translation Models from Moore Threads | Aug 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 | 5 |
| BhasaAnuvaad: A Speech Translation Dataset for 13 Indian Languages | Nov 7, 2024 | automatic-speech-translationSynthetic Data Generation | CodeCode Available | 1 | 5 |
| Word Level Timestamp Generation for Automatic Speech Recognition and Translation | May 21, 2025 | Automatic Speech Recognitionautomatic-speech-translation | CodeCode Available | 0 | 5 |
| End-to-End Automatic Speech Translation of Audiobooks | Feb 12, 2018 | automatic-speech-translationSpeech-to-Text | CodeCode Available | 0 | 5 |
| Improving End-to-End Speech Translation by Imitation-Based Knowledge Distillation with Synthetic Transcripts | Jul 17, 2023 | automatic-speech-translationImitation Learning | CodeCode Available | 0 | 5 |
| SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation | Feb 27, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Breeding Gender-aware Direct Speech Translation Systems | Dec 9, 2020 | automatic-speech-translationMachine Translation | —Unverified | 0 | 0 |
| Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities | May 13, 2025 | automatic-speech-translationBenchmarking | —Unverified | 0 | 0 |
| Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Improved Cross-Lingual Transfer Learning For Automatic Speech Translation | Jun 1, 2023 | automatic-speech-translationCross-Lingual Transfer | —Unverified | 0 | 0 |
| Towards the evaluation of automatic simultaneous speech translation from a communicative perspective | Mar 15, 2021 | automatic-speech-translationInformativeness | —Unverified | 0 | 0 |
| LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models | Jun 5, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade | Sep 14, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| LiSTra, Automatic Speech Translation: English to Lingala case study | May 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| LiSTra Automatic Speech Translation: English to Lingala Case Study | Jun 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Deep Net Features for Complex Emotion Recognition | Oct 31, 2018 | automatic-speech-translationEmotion Recognition | —Unverified | 0 | 0 |
| Mu^2SLAM: Multitask, Multilingual Speech and Language Models | Dec 19, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Robustness of Multi-Source MT to Transcription Errors | May 26, 2023 | automatic-speech-translationMachine Translation | —Unverified | 0 | 0 |
| Chain-of-Thought Prompting for Speech Translation | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Development of Hybrid ASR Systems for Low Resource Medical Domain Conversational Telephone Speech | Oct 24, 2022 | automatic-speech-translationTranslation | —Unverified | 0 | 0 |
| ELITR Multilingual Live Subtitling: Demo and Strategy | Apr 1, 2021 | automatic-speech-translationTranslation | —Unverified | 0 | 0 |
| EMMeTT: Efficient Multimodal Machine Translation Training | Sep 20, 2024 | automatic-speech-translationDecoder | —Unverified | 0 | 0 |