| Kurdish (Sorani) Speech to Text: Presenting an Experimental Dataset | Nov 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text models | Jun 29, 2022 | Intent ClassificationSlot Filling | CodeCode Available | 0 |
| CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation Units | Jul 19, 2024 | Machine TranslationSpeech-to-Text | CodeCode Available | 0 |
| Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning | Sep 21, 2016 | DecoderMulti-Task Learning | CodeCode Available | 0 |
| Let's Give a Voice to Conversational Agents in Virtual Reality | Aug 4, 2023 | Speech-to-Texttext-to-speech | CodeCode Available | 0 |
| fairseq S2T: Fast Speech-to-Text Modeling with fairseq | Oct 11, 2020 | Machine TranslationMulti-Task Learning | CodeCode Available | 0 |
| BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation System | May 29, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Tools and resources for Romanian text-to-speech and speech-to-text applications | Feb 15, 2018 | speech-recognitionSpeech Recognition | CodeCode Available | 0 |
| LibriS2S: A German-English Speech-to-Speech Translation Corpus | Apr 22, 2022 | Speech-to-Speech TranslationSpeech-to-Text | CodeCode Available | 0 |
| Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects | Jun 27, 2024 | Automatic Speech RecognitionMachine Translation | CodeCode Available | 0 |
| Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models | Jul 9, 2024 | coreference-resolutionCoreference Resolution | CodeCode Available | 0 |
| Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text Translation | Dec 6, 2016 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 |
| Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision | Dec 30, 2023 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 |
| Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair | Apr 18, 2024 | Machine TranslationSpeech-to-Text | CodeCode Available | 0 |
| Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments | Feb 21, 2022 | Data AugmentationPhoneme Recognition | CodeCode Available | 0 |
| SparQLe: Speech Queries to Text Translation Through LLMs | Feb 13, 2025 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 |
| Streaming Sequence Transduction through Dynamic Compression | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation | Jul 3, 2022 | DecoderSpeech-to-Text | CodeCode Available | 0 |
| A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion | Jul 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Measuring the Effect of Transcription Noise on Downstream Language Understanding Tasks | Feb 19, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition | Dec 23, 2021 | BenchmarkingDeep Learning | CodeCode Available | 0 |
| End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic Hands | Sep 22, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Pre-training on high-resource speech recognition improves low-resource speech-to-text translation | Sep 5, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architecture | Jan 6, 2022 | Speech-to-Texttext-to-speech | CodeCode Available | 0 |
| Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation | Dec 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |