| Robust Semantic Communications for Speech Transmission | Mar 8, 2024 | Generative Adversarial NetworkSemantic Communication | —Unverified | 0 | 0 |
| Role of Intonation in Scoring Spoken English | Aug 23, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks | Jul 14, 2022 | Speech-to-Text | —Unverified | 0 | 0 |
| S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation | Jun 11, 2025 | Reading ComprehensionSpeech Synthesis | —Unverified | 0 | 0 |
| SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation | Oct 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation | May 17, 2022 | Representation LearningRetrieval | —Unverified | 0 | 0 |
| Self-Supervised Representations Improve End-to-End Speech Translation | Jun 22, 2020 | Cross-Lingual Transferspeech-recognition | —Unverified | 0 | 0 |
| Semantic-aware Speech to Text Transmission with Redundancy Removal | Feb 7, 2022 | Semantic CommunicationSpeech-to-Text | —Unverified | 0 | 0 |
| Semantic MIMO Systems for Speech-to-Text Transmission | May 13, 2024 | Semantic CommunicationSpeech-to-Text | —Unverified | 0 | 0 |
| Semantic-preserved Communication System for Highly Efficient Speech Transmission | May 25, 2022 | Semantic Communicationspeech-recognition | —Unverified | 0 | 0 |
| Simple and Effective Unsupervised Speech Translation | Oct 18, 2022 | Domain AdaptationMachine Translation | —Unverified | 0 | 0 |
| SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation | Jun 20, 2024 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 | 0 |
| SimulSpeech: End-to-End Simultaneous Speech to Text Translation | Jul 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset | Apr 13, 2020 | Gaze PredictionSpeech-to-Text | —Unverified | 0 | 0 |
| Speaker Independent Continuous Speech to Text Converter for Mobile Application | Jul 19, 2013 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction | May 8, 2013 | Speech SynthesisSpeech-to-Text | —Unverified | 0 | 0 |
| SpeechAlign: a Framework for Speech Translation Alignment Evaluation | Sep 20, 2023 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 | 0 |
| Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody? | Oct 31, 2024 | Rhythmspeech-recognition | —Unverified | 0 | 0 |
| Speech Recognition Web Services for Dutch | May 1, 2014 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Speech to Speech Translation with Translatotron: A State of the Art Review | Feb 9, 2025 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation | May 17, 2020 | Computational Efficiencyspeech-recognition | —Unverified | 0 | 0 |
| Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding | Jun 8, 2023 | dialog state trackingLanguage Modeling | —Unverified | 0 | 0 |
| Speech-to-Text and Evaluation of Multiple Machine Translation Systems | Sep 1, 2022 | Machine TranslationSpeech-to-Text | —Unverified | 0 | 0 |
| Speech to text and text to speech recognition systems-Areview | Mar 17, 2018 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| Speech-to-Text Translation with Phoneme-Augmented CoT: Enhancing Cross-Lingual Transfer in Low-Resource Scenarios | May 30, 2025 | Cross-Lingual TransferPhoneme Recognition | —Unverified | 0 | 0 |
| Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing? | Feb 19, 2024 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 | 0 |
| SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation | Nov 3, 2024 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| SpiCE: A New Open-Access Corpus of Conversational Bilingual Speech in Cantonese and English | May 1, 2020 | SentenceSpeech-to-Text | —Unverified | 0 | 0 |
| Strategies for improving low resource speech to text translation relying on pre-trained ASR models | May 31, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 | 0 |
| StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection | Jun 10, 2024 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 | 0 |
| STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions | May 30, 2023 | AllAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines | May 1, 2020 | Cross-Lingual Information RetrievalInformation Retrieval | —Unverified | 0 | 0 |
| Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines | Oct 19, 2020 | Cross-Lingual Information RetrievalInformation Retrieval | —Unverified | 0 | 0 |
| Swiss German Speech to Text system evaluation | Jul 1, 2022 | Speech-to-Text | —Unverified | 0 | 0 |
| Syllable based DNN-HMM Cantonese Speech to Text System | Feb 13, 2024 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| Synthetic Query Generation using Large Language Models for Virtual Assistants | Jun 10, 2024 | Information Retrievalspeech-recognition | —Unverified | 0 | 0 |
| System Description on Automatic Simultaneous Translation Workshop | Jul 1, 2022 | SentenceSpeech-to-Text | —Unverified | 0 | 0 |
| TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS | Jun 10, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale | Feb 27, 2025 | AI AgentLarge Language Model | —Unverified | 0 | 0 |
| Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise | Jun 13, 2019 | Data AugmentationDecoder | —Unverified | 0 | 0 |
| The 2016 KIT IWSLT Speech-to-Text Systems for English and German | Dec 1, 2016 | Speech-to-Text | —Unverified | 0 | 0 |
| The 2017 KIT IWSLT Speech-to-Text Systems for English and German | Dec 1, 2017 | Speech-to-Text | —Unverified | 0 | 0 |
| The AISP-SJTU Simultaneous Translation System for IWSLT 2022 | May 1, 2022 | Speech-to-TextTranslation | —Unverified | 0 | 0 |
| The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation | May 1, 2022 | ChunkingSentence | —Unverified | 0 | 0 |
| The IWSLT 2019 Evaluation Campaign | Nov 1, 2019 | Speech-to-TextTranslation | —Unverified | 0 | 0 |
| The MIT Voice Name System | Mar 28, 2022 | Speech-to-Text | —Unverified | 0 | 0 |
| The Nós Project: Opening routes for the Galician language in the field of language technologies | Jun 1, 2022 | Cultural Vocal Bursts Intensity PredictionMachine Translation | —Unverified | 0 | 0 |
| The Spotify Podcast Dataset | Apr 8, 2020 | Speech-to-Text | —Unverified | 0 | 0 |
| The USFD Spoken Language Translation System for IWSLT 2014 | Sep 13, 2015 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 | Jul 1, 2021 | Data AugmentationSpeech-to-Text | —Unverified | 0 | 0 |