Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 403 papers

Title	Date	Tasks	Status
The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model Convergence	May 29, 2025	Speech-to-Text	—Unverified
Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck	Oct 15, 2024	Speech-to-Text	—Unverified
Toward Automated Clinical Transcriptions	Sep 20, 2024	Speech-to-Text	—Unverified
Toward Joint Language Modeling for Speech Units and Text	Oct 12, 2023	Language ModelingLanguage Modelling	—Unverified
Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool	Jun 1, 2022	Sign Language TranslationSpeech-to-Text	—Unverified
Towards Robust Speech-to-Text Adversarial Attack	Mar 15, 2021	Adversarial AttackRoom Impulse Response (RIR)	—Unverified
Towards speech-to-text translation without speech recognition	Feb 13, 2017	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards the evaluation of automatic simultaneous speech translation from a communicative perspective	Mar 15, 2021	automatic-speech-translationInformativeness	—Unverified
Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders	Jul 2, 2024	Clusteringspeaker-diarization	—Unverified
Towards Unsupervised Speech-to-Text Translation	Nov 4, 2018	DenoisingLanguage Modeling	—Unverified
Training end-to-end speech-to-text models on mobile phones	Dec 7, 2021	CPUSpeech-to-Text	—Unverified
Transducer Consistency Regularization for Speech to Text Applications	Oct 9, 2024	Model OptimizationSpeech-to-Text	—Unverified
Transferable speech-to-text large language model alignment module	Jun 19, 2024	Language ModelingLanguage Modelling	—Unverified
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces	May 18, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Unsupervised Data Validation Methods for Efficient Model Training	Oct 10, 2024	Data Augmentationmodel	—Unverified
Unveiling the Role of Pretraining in Direct Speech Translation	Sep 26, 2024	Automatic Speech RecognitionDecoder	—Unverified
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition	Jan 6, 2023	Domain AdaptationGPU	—Unverified
Using of heterogeneous corpora for training of an ASR system	Jun 1, 2017	speech-recognitionSpeech Recognition	—Unverified
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation	May 25, 2023	DecoderLanguage Modeling	—Unverified
Visual Features for Context-Aware Speech Recognition	Dec 1, 2017	Language ModelingLanguage Modelling	—Unverified
Voice based self help System: User Experience Vs Accuracy	Apr 7, 2015	speech-recognitionSpeech Recognition	—Unverified
VR-GPT: Visual Language Model for Intelligent Virtual Reality Applications	May 19, 2024	Language ModelingLanguage Modelling	—Unverified
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment	Apr 22, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts	Mar 6, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition	Sep 19, 2021	Language ModelingLanguage Modelling	—Unverified
WER-BERT: Automatic WER Estimation with BERT in a Balanced Ordinal Classification Paradigm	Jan 14, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice	May 10, 2021	speech-recognitionSpeech Recognition	—Unverified
When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation	Feb 1, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Which French speech recognition system for assistant robots?	Mar 4, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Whisper Finetuning on Nepali Language	Nov 19, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages	Dec 31, 2024	Automatic Speech RecognitionData Augmentation	—Unverified
With One Voice: Composing a Travel Voice Assistant from Re-purposed Models	Aug 4, 2021	BIG-bench Machine Learningnamed-entity-recognition	—Unverified
Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering	Jun 1, 2021	Knowledge GraphsQuestion Answering	—Unverified
XTREME-S: Evaluating Cross-lingual Speech Representations	Mar 21, 2022	Representation LearningRetrieval	—Unverified
NUVA: A Naming Utterance Verifier for Aphasia Treatment	Feb 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect	May 7, 2021	BenchmarkingSpeech-to-Text	—Unverified
A Case Study on Filtering for End-to-End Speech Translation	Feb 2, 2024	Speech-to-Speech TranslationSpeech-to-Text	—Unverified
A combined approach to the analysis of speech conversations in a contact center domain	Mar 12, 2022	Speech-to-Text	—Unverified
A Comparative Study on End-to-end Speech to Text Translation	Nov 20, 2019	Speech-to-TextSpeech-to-Text Translation	—Unverified
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation	Oct 11, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Acquisition of high-quality images for camera calibration in robotics applications via speech prompts	Apr 15, 2025	Camera CalibrationSpeech-to-Text	—Unverified
AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation	Mar 18, 2025	DecoderSpeech-to-Text	—Unverified
A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research	May 1, 2016	SentenceSpeech-to-Text	—Unverified
Adversarial Attacks against Neural Networks in Audio Domain: Exploiting Principal Components	Jul 14, 2020	ClassificationGeneral Classification	—Unverified
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR	Sep 30, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks	Oct 21, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
A.I. based Embedded Speech to Text Using Deepspeech	Feb 25, 2020	Raspberry Pi 3speech-recognition	—Unverified
AI-Based IVR	Aug 20, 2024	Speech SynthesisSpeech-to-Text	—Unverified
AI-Powered Immersive Assistance for Interactive Task Execution in Industrial Environments	Jul 12, 2024	Language ModelingLanguage Modelling	—Unverified
Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems	Oct 3, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified

Show:10 25 50

← PrevPage 6 of 9Next →

No leaderboard results yet.