| The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model Convergence | May 29, 2025 | Speech-to-Text | —Unverified | 0 | 0 |
| Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck | Oct 15, 2024 | Speech-to-Text | —Unverified | 0 | 0 |
| Toward Automated Clinical Transcriptions | Sep 20, 2024 | Speech-to-Text | —Unverified | 0 | 0 |
| Toward Joint Language Modeling for Speech Units and Text | Oct 12, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool | Jun 1, 2022 | Sign Language TranslationSpeech-to-Text | —Unverified | 0 | 0 |
| Towards Robust Speech-to-Text Adversarial Attack | Mar 15, 2021 | Adversarial AttackRoom Impulse Response (RIR) | —Unverified | 0 | 0 |
| Towards speech-to-text translation without speech recognition | Feb 13, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Towards the evaluation of automatic simultaneous speech translation from a communicative perspective | Mar 15, 2021 | automatic-speech-translationInformativeness | —Unverified | 0 | 0 |
| Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders | Jul 2, 2024 | Clusteringspeaker-diarization | —Unverified | 0 | 0 |
| Towards Unsupervised Speech-to-Text Translation | Nov 4, 2018 | DenoisingLanguage Modeling | —Unverified | 0 | 0 |
| Training end-to-end speech-to-text models on mobile phones | Dec 7, 2021 | CPUSpeech-to-Text | —Unverified | 0 | 0 |
| Transducer Consistency Regularization for Speech to Text Applications | Oct 9, 2024 | Model OptimizationSpeech-to-Text | —Unverified | 0 | 0 |
| Transferable speech-to-text large language model alignment module | Jun 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces | May 18, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Unsupervised Data Validation Methods for Efficient Model Training | Oct 10, 2024 | Data Augmentationmodel | —Unverified | 0 | 0 |
| Unveiling the Role of Pretraining in Direct Speech Translation | Sep 26, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 | 0 |
| Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition | Jan 6, 2023 | Domain AdaptationGPU | —Unverified | 0 | 0 |
| Using of heterogeneous corpora for training of an ASR system | Jun 1, 2017 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation | May 25, 2023 | DecoderLanguage Modeling | —Unverified | 0 | 0 |
| Visual Features for Context-Aware Speech Recognition | Dec 1, 2017 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Voice based self help System: User Experience Vs Accuracy | Apr 7, 2015 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| VR-GPT: Visual Language Model for Intelligent Virtual Reality Applications | May 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment | Apr 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts | Mar 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition | Sep 19, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |