Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–275 of 403 papers

Title	Date	Tasks	Status
The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model Convergence	May 29, 2025	Speech-to-Text	—Unverified
Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck	Oct 15, 2024	Speech-to-Text	—Unverified
Toward Automated Clinical Transcriptions	Sep 20, 2024	Speech-to-Text	—Unverified
Toward Joint Language Modeling for Speech Units and Text	Oct 12, 2023	Language ModelingLanguage Modelling	—Unverified
Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool	Jun 1, 2022	Sign Language TranslationSpeech-to-Text	—Unverified
Towards Robust Speech-to-Text Adversarial Attack	Mar 15, 2021	Adversarial AttackRoom Impulse Response (RIR)	—Unverified
Towards speech-to-text translation without speech recognition	Feb 13, 2017	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards the evaluation of automatic simultaneous speech translation from a communicative perspective	Mar 15, 2021	automatic-speech-translationInformativeness	—Unverified
Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders	Jul 2, 2024	Clusteringspeaker-diarization	—Unverified
Towards Unsupervised Speech-to-Text Translation	Nov 4, 2018	DenoisingLanguage Modeling	—Unverified
Training end-to-end speech-to-text models on mobile phones	Dec 7, 2021	CPUSpeech-to-Text	—Unverified
Transducer Consistency Regularization for Speech to Text Applications	Oct 9, 2024	Model OptimizationSpeech-to-Text	—Unverified
Transferable speech-to-text large language model alignment module	Jun 19, 2024	Language ModelingLanguage Modelling	—Unverified
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces	May 18, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Unsupervised Data Validation Methods for Efficient Model Training	Oct 10, 2024	Data Augmentationmodel	—Unverified
Unveiling the Role of Pretraining in Direct Speech Translation	Sep 26, 2024	Automatic Speech RecognitionDecoder	—Unverified
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition	Jan 6, 2023	Domain AdaptationGPU	—Unverified
Using of heterogeneous corpora for training of an ASR system	Jun 1, 2017	speech-recognitionSpeech Recognition	—Unverified
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation	May 25, 2023	DecoderLanguage Modeling	—Unverified
Visual Features for Context-Aware Speech Recognition	Dec 1, 2017	Language ModelingLanguage Modelling	—Unverified
Voice based self help System: User Experience Vs Accuracy	Apr 7, 2015	speech-recognitionSpeech Recognition	—Unverified
VR-GPT: Visual Language Model for Intelligent Virtual Reality Applications	May 19, 2024	Language ModelingLanguage Modelling	—Unverified
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment	Apr 22, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts	Mar 6, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition	Sep 19, 2021	Language ModelingLanguage Modelling	—Unverified

Show:10 25 50

← PrevPage 11 of 17Next →

No leaderboard results yet.