| Attention-Based End-to-End Speech Recognition on Voice Search | Jul 22, 2017 | DecoderL2 Regularization | —Unverified | 0 |
| Improving Metrics for Speech Translation | May 22, 2023 | Speech-to-TextTranslation | —Unverified | 0 |
| Improving RNN-Transducers with Acoustic LookAhead | Jul 11, 2023 | HallucinationSpeech-to-Text | —Unverified | 0 |
| CoLLD: Contrastive Layer-to-layer Distillation for Compressing Multilingual Pre-trained Speech Encoders | Sep 14, 2023 | Contrastive LearningKnowledge Distillation | —Unverified | 0 |
| Improving Speech Recognition Accuracy Using Custom Language Models with the Vosk Toolkit | Mar 26, 2025 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task | Jul 12, 2021 | DecoderKnowledge Distillation | —Unverified | 0 |
| Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach | Oct 6, 2023 | Simultaneous Speech-to-Text TranslationSpeech-to-Text | —Unverified | 0 |
| Extending RNN-T-based speech recognition systems with emotion and language classification | Jul 28, 2022 | Emotion ClassificationEmotion Recognition | —Unverified | 0 |
| IMS-Speech: A Speech to Text Tool | Aug 13, 2019 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments | Jul 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Exploring Transfer Learning For End-to-End Spoken Language Understanding | Dec 15, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset | Jun 15, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems | Mar 10, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Infusing Future Information into Monotonic Attention Through Language Models | Sep 7, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multilingual Speech Translation with Efficient Finetuning of Pretrained Models | Oct 24, 2020 | Cross-Lingual TransferDecoder | —Unverified | 0 |
| A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation | Oct 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Interpreting Strategies Annotation in the WAW Corpus | Sep 1, 2017 | Machine TranslationSpeech-to-Text | —Unverified | 0 |
| Investigating Decoder-only Large Language Models for Speech-to-text Translation | Jul 3, 2024 | Decoderparameter-efficient fine-tuning | —Unverified | 0 |
| Jointly Trained Transformers models for Spoken Language Translation | Apr 25, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Language Model Augmented Monotonic Attention for Simultaneous Translation | Jul 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Isochrony-Controlled Speech-to-Text Translation: A study on translating from Sino-Tibetan to Indo-European Languages | Nov 11, 2024 | DecoderMachine Translation | —Unverified | 0 |
| I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs | Jun 17, 2025 | 3D visual groundingContrastive Learning | —Unverified | 0 |
| Existential Crisis: A Social Robot's Reason for Being | Jan 6, 2025 | Speech-to-Text | —Unverified | 0 |
| Evaluation of real-time transcriptions using end-to-end ASR models | Sep 9, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| CMU's IWSLT 2024 Simultaneous Speech Translation System | Aug 14, 2024 | DecoderSpeech-to-Text | —Unverified | 0 |