| CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR | Nov 7, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| DARTS: Dialectal Arabic Transcription System | Sep 26, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SpeechAlign: a Framework for Speech Translation Alignment Evaluation | Sep 20, 2023 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 |
| Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody? | Oct 31, 2024 | Rhythmspeech-recognition | —Unverified | 0 |
| Speech Recognition Web Services for Dutch | May 1, 2014 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech to Speech Translation with Translatotron: A State of the Art Review | Feb 9, 2025 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation | May 17, 2020 | Computational Efficiencyspeech-recognition | —Unverified | 0 |
| Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding | Jun 8, 2023 | dialog state trackingLanguage Modeling | —Unverified | 0 |
| Speech-to-Text and Evaluation of Multiple Machine Translation Systems | Sep 1, 2022 | Machine TranslationSpeech-to-Text | —Unverified | 0 |
| Speech to text and text to speech recognition systems-Areview | Mar 17, 2018 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Speech-to-Text Translation with Phoneme-Augmented CoT: Enhancing Cross-Lingual Transfer in Low-Resource Scenarios | May 30, 2025 | Cross-Lingual TransferPhoneme Recognition | —Unverified | 0 |
| Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing? | Feb 19, 2024 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 |
| SpiCE: A New Open-Access Corpus of Conversational Bilingual Speech in Cantonese and English | May 1, 2020 | SentenceSpeech-to-Text | —Unverified | 0 |
| Strategies for improving low resource speech to text translation relying on pre-trained ASR models | May 31, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions | May 30, 2023 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines | May 1, 2020 | Cross-Lingual Information RetrievalInformation Retrieval | —Unverified | 0 |
| Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines | Oct 19, 2020 | Cross-Lingual Information RetrievalInformation Retrieval | —Unverified | 0 |
| Swiss German Speech to Text system evaluation | Jul 1, 2022 | Speech-to-Text | —Unverified | 0 |
| Syllable based DNN-HMM Cantonese Speech to Text System | Feb 13, 2024 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Synthetic Query Generation using Large Language Models for Virtual Assistants | Jun 10, 2024 | Information Retrievalspeech-recognition | —Unverified | 0 |
| System Description on Automatic Simultaneous Translation Workshop | Jul 1, 2022 | SentenceSpeech-to-Text | —Unverified | 0 |
| TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS | Jun 10, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale | Feb 27, 2025 | AI AgentLarge Language Model | —Unverified | 0 |
| Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise | Jun 13, 2019 | Data AugmentationDecoder | —Unverified | 0 |
| The 2016 KIT IWSLT Speech-to-Text Systems for English and German | Dec 1, 2016 | Speech-to-Text | —Unverified | 0 |
| The 2017 KIT IWSLT Speech-to-Text Systems for English and German | Dec 1, 2017 | Speech-to-Text | —Unverified | 0 |
| The AISP-SJTU Simultaneous Translation System for IWSLT 2022 | May 1, 2022 | Speech-to-TextTranslation | —Unverified | 0 |
| The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation | May 1, 2022 | ChunkingSentence | —Unverified | 0 |
| The IWSLT 2019 Evaluation Campaign | Nov 1, 2019 | Speech-to-TextTranslation | —Unverified | 0 |
| The MIT Voice Name System | Mar 28, 2022 | Speech-to-Text | —Unverified | 0 |
| The Nós Project: Opening routes for the Galician language in the field of language technologies | Jun 1, 2022 | Cultural Vocal Bursts Intensity PredictionMachine Translation | —Unverified | 0 |
| The Spotify Podcast Dataset | Apr 8, 2020 | Speech-to-Text | —Unverified | 0 |
| The USFD Spoken Language Translation System for IWSLT 2014 | Sep 13, 2015 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 | Jul 1, 2021 | Data AugmentationSpeech-to-Text | —Unverified | 0 |
| Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck | Oct 15, 2024 | Speech-to-Text | —Unverified | 0 |
| Toward Automated Clinical Transcriptions | Sep 20, 2024 | Speech-to-Text | —Unverified | 0 |
| Toward Joint Language Modeling for Speech Units and Text | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool | Jun 1, 2022 | Sign Language TranslationSpeech-to-Text | —Unverified | 0 |
| Towards Robust Speech-to-Text Adversarial Attack | Mar 15, 2021 | Adversarial AttackRoom Impulse Response (RIR) | —Unverified | 0 |
| Towards speech-to-text translation without speech recognition | Feb 13, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards the evaluation of automatic simultaneous speech translation from a communicative perspective | Mar 15, 2021 | automatic-speech-translationInformativeness | —Unverified | 0 |
| Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders | Jul 2, 2024 | Clusteringspeaker-diarization | —Unverified | 0 |
| Towards Unsupervised Speech-to-Text Translation | Nov 4, 2018 | DenoisingLanguage Modeling | —Unverified | 0 |
| Training end-to-end speech-to-text models on mobile phones | Dec 7, 2021 | CPUSpeech-to-Text | —Unverified | 0 |
| Transducer Consistency Regularization for Speech to Text Applications | Oct 9, 2024 | Model OptimizationSpeech-to-Text | —Unverified | 0 |
| Transferable speech-to-text large language model alignment module | Jun 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces | May 18, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unsupervised Data Validation Methods for Efficient Model Training | Oct 10, 2024 | Data Augmentationmodel | —Unverified | 0 |
| Unveiling the Role of Pretraining in Direct Speech Translation | Sep 26, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition | Jan 6, 2023 | Domain AdaptationGPU | —Unverified | 0 |