Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 403 papers

Title	Date	Tasks	Status
Contextualized Translation of Automatically Segmented Speech	Aug 5, 2020	SegmentationSentence	—Unverified
Conversational Recommendation System using NLP and Sentiment Analysis	May 17, 2025	Conversational RecommendationDynamic Time Warping	—Unverified
Corpus Creation and Evaluation for Speech-to-Text and Speech Translation	Aug 1, 2021	Machine TranslationSpeech-to-Text	—Unverified
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning	Nov 3, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving	Jun 16, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Crossing the SSH Bridge with Interview Data	May 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Cross-modal Contrastive Learning for Speech Translation	Dec 17, 2021	Contrastive LearningRetrieval	—Unverified
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing	Sep 27, 2023	DecoderMachine Translation	—Unverified
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models	Oct 24, 2020	Cross-Lingual TransferDecoder	—Unverified
CTC Alignments Improve Autoregressive Translation	Oct 11, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines	May 1, 2020	Cross-Lingual Information RetrievalInformation Retrieval	—Unverified
Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines	Oct 19, 2020	Cross-Lingual Information RetrievalInformation Retrieval	—Unverified
Swiss German Speech to Text system evaluation	Jul 1, 2022	Speech-to-Text	—Unverified
Syllable based DNN-HMM Cantonese Speech to Text System	Feb 13, 2024	speech-recognitionSpeech Recognition	—Unverified
Synthetic Query Generation using Large Language Models for Virtual Assistants	Jun 10, 2024	Information Retrievalspeech-recognition	—Unverified
System Description on Automatic Simultaneous Translation Workshop	Jul 1, 2022	SentenceSpeech-to-Text	—Unverified
TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS	Jun 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale	Feb 27, 2025	AI AgentLarge Language Model	—Unverified
Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise	Jun 13, 2019	Data AugmentationDecoder	—Unverified
The 2016 KIT IWSLT Speech-to-Text Systems for English and German	Dec 1, 2016	Speech-to-Text	—Unverified
The 2017 KIT IWSLT Speech-to-Text Systems for English and German	Dec 1, 2017	Speech-to-Text	—Unverified
The AISP-SJTU Simultaneous Translation System for IWSLT 2022	May 1, 2022	Speech-to-TextTranslation	—Unverified
The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation	May 1, 2022	ChunkingSentence	—Unverified
The IWSLT 2019 Evaluation Campaign	Nov 1, 2019	Speech-to-TextTranslation	—Unverified
The MIT Voice Name System	Mar 28, 2022	Speech-to-Text	—Unverified
The Nós Project: Opening routes for the Galician language in the field of language technologies	Jun 1, 2022	Cultural Vocal Bursts Intensity PredictionMachine Translation	—Unverified
The Spotify Podcast Dataset	Apr 8, 2020	Speech-to-Text	—Unverified
The USFD Spoken Language Translation System for IWSLT 2014	Sep 13, 2015	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021	Jul 1, 2021	Data AugmentationSpeech-to-Text	—Unverified
The Warmup Dilemma: How Learning Rate Strategies Impact Speech-to-Text Model Convergence	May 29, 2025	Speech-to-Text	—Unverified
Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck	Oct 15, 2024	Speech-to-Text	—Unverified
Toward Automated Clinical Transcriptions	Sep 20, 2024	Speech-to-Text	—Unverified
Toward Joint Language Modeling for Speech Units and Text	Oct 12, 2023	Language ModelingLanguage Modelling	—Unverified
Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool	Jun 1, 2022	Sign Language TranslationSpeech-to-Text	—Unverified
Towards Robust Speech-to-Text Adversarial Attack	Mar 15, 2021	Adversarial AttackRoom Impulse Response (RIR)	—Unverified
Towards speech-to-text translation without speech recognition	Feb 13, 2017	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Towards the evaluation of automatic simultaneous speech translation from a communicative perspective	Mar 15, 2021	automatic-speech-translationInformativeness	—Unverified
Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders	Jul 2, 2024	Clusteringspeaker-diarization	—Unverified
Towards Unsupervised Speech-to-Text Translation	Nov 4, 2018	DenoisingLanguage Modeling	—Unverified
Training end-to-end speech-to-text models on mobile phones	Dec 7, 2021	CPUSpeech-to-Text	—Unverified
Transducer Consistency Regularization for Speech to Text Applications	Oct 9, 2024	Model OptimizationSpeech-to-Text	—Unverified
Transferable speech-to-text large language model alignment module	Jun 19, 2024	Language ModelingLanguage Modelling	—Unverified
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces	May 18, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Unsupervised Data Validation Methods for Efficient Model Training	Oct 10, 2024	Data Augmentationmodel	—Unverified
Unveiling the Role of Pretraining in Direct Speech Translation	Sep 26, 2024	Automatic Speech RecognitionDecoder	—Unverified
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition	Jan 6, 2023	Domain AdaptationGPU	—Unverified
Using of heterogeneous corpora for training of an ASR system	Jun 1, 2017	speech-recognitionSpeech Recognition	—Unverified
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation	May 25, 2023	DecoderLanguage Modeling	—Unverified
Visual Features for Context-Aware Speech Recognition	Dec 1, 2017	Language ModelingLanguage Modelling	—Unverified
Voice based self help System: User Experience Vs Accuracy	Apr 7, 2015	speech-recognitionSpeech Recognition	—Unverified

Show:10 25 50

← PrevPage 5 of 9Next →

No leaderboard results yet.