| Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation | May 17, 2020 | Computational Efficiencyspeech-recognition | —Unverified | 0 | 0 |
| Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding | Jun 8, 2023 | dialog state trackingLanguage Modeling | —Unverified | 0 | 0 |
| Speech-to-Text and Evaluation of Multiple Machine Translation Systems | Sep 1, 2022 | Machine TranslationSpeech-to-Text | —Unverified | 0 | 0 |
| Speech to text and text to speech recognition systems-Areview | Mar 17, 2018 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| Speech-to-Text Translation with Phoneme-Augmented CoT: Enhancing Cross-Lingual Transfer in Low-Resource Scenarios | May 30, 2025 | Cross-Lingual TransferPhoneme Recognition | —Unverified | 0 | 0 |
| Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing? | Feb 19, 2024 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 | 0 |
| SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation | Nov 3, 2024 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| SpiCE: A New Open-Access Corpus of Conversational Bilingual Speech in Cantonese and English | May 1, 2020 | SentenceSpeech-to-Text | —Unverified | 0 | 0 |
| Strategies for improving low resource speech to text translation relying on pre-trained ASR models | May 31, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 | 0 |
| StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection | Jun 10, 2024 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 | 0 |