Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 403 papers

Title	Date	Tasks	Status
Robust Semantic Communications for Speech Transmission	Mar 8, 2024	Generative Adversarial NetworkSemantic Communication	—Unverified
Role of Intonation in Scoring Spoken English	Aug 23, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks	Jul 14, 2022	Speech-to-Text	—Unverified
S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation	Jun 11, 2025	Reading ComprehensionSpeech Synthesis	—Unverified
SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation	Oct 13, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation	May 17, 2022	Representation LearningRetrieval	—Unverified
Self-Supervised Representations Improve End-to-End Speech Translation	Jun 22, 2020	Cross-Lingual Transferspeech-recognition	—Unverified
Semantic-aware Speech to Text Transmission with Redundancy Removal	Feb 7, 2022	Semantic CommunicationSpeech-to-Text	—Unverified
Semantic MIMO Systems for Speech-to-Text Transmission	May 13, 2024	Semantic CommunicationSpeech-to-Text	—Unverified
Semantic-preserved Communication System for Highly Efficient Speech Transmission	May 25, 2022	Semantic Communicationspeech-recognition	—Unverified
Simple and Effective Unsupervised Speech Translation	Oct 18, 2022	Domain AdaptationMachine Translation	—Unverified
SimulSeamless: FBK at IWSLT 2024 Simultaneous Speech Translation	Jun 20, 2024	Speech-to-TextSpeech-to-Text Translation	—Unverified
SimulSpeech: End-to-End Simultaneous Speech to Text Translation	Jul 1, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Speak2Label: Using Domain Knowledge for Creating a Large Scale Driver Gaze Zone Estimation Dataset	Apr 13, 2020	Gaze PredictionSpeech-to-Text	—Unverified
Speaker Independent Continuous Speech to Text Converter for Mobile Application	Jul 19, 2013	Action DetectionActivity Detection	—Unverified
Speech: A Challenge to Digital Signal Processing Technology for Human-to-Computer Interaction	May 8, 2013	Speech SynthesisSpeech-to-Text	—Unverified
SpeechAlign: a Framework for Speech Translation Alignment Evaluation	Sep 20, 2023	Speech-to-TextSpeech-to-Text Translation	—Unverified
Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody?	Oct 31, 2024	Rhythmspeech-recognition	—Unverified
Speech Recognition Web Services for Dutch	May 1, 2014	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Speech to Speech Translation with Translatotron: A State of the Art Review	Feb 9, 2025	speech-recognitionSpeech Recognition	—Unverified
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation	May 17, 2020	Computational Efficiencyspeech-recognition	—Unverified
Speech-to-Text Adapter and Speech-to-Entity Retriever Augmented LLMs for Speech Understanding	Jun 8, 2023	dialog state trackingLanguage Modeling	—Unverified
Speech-to-Text and Evaluation of Multiple Machine Translation Systems	Sep 1, 2022	Machine TranslationSpeech-to-Text	—Unverified
Speech to text and text to speech recognition systems-Areview	Mar 17, 2018	speech-recognitionSpeech Recognition	—Unverified
Speech-to-Text Translation with Phoneme-Augmented CoT: Enhancing Cross-Lingual Transfer in Low-Resource Scenarios	May 30, 2025	Cross-Lingual TransferPhoneme Recognition	—Unverified
Speech Translation with Speech Foundation Models and Large Language Models: What is There and What is Missing?	Feb 19, 2024	Speech-to-TextSpeech-to-Text Translation	—Unverified
SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation	Nov 3, 2024	speech-recognitionSpeech Recognition	—Unverified
SpiCE: A New Open-Access Corpus of Conversational Bilingual Speech in Cantonese and English	May 1, 2020	SentenceSpeech-to-Text	—Unverified
Strategies for improving low resource speech to text translation relying on pre-trained ASR models	May 31, 2023	Automatic Speech RecognitionDecoder	—Unverified
StreamAtt: Direct Streaming Speech-to-Text Translation with Attention-based Audio History Selection	Jun 10, 2024	Speech-to-TextSpeech-to-Text Translation	—Unverified
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions	May 30, 2023	AllAutomatic Speech Recognition	—Unverified
Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines	May 1, 2020	Cross-Lingual Information RetrievalInformation Retrieval	—Unverified
Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines	Oct 19, 2020	Cross-Lingual Information RetrievalInformation Retrieval	—Unverified
Swiss German Speech to Text system evaluation	Jul 1, 2022	Speech-to-Text	—Unverified
Syllable based DNN-HMM Cantonese Speech to Text System	Feb 13, 2024	speech-recognitionSpeech Recognition	—Unverified
Synthetic Query Generation using Large Language Models for Virtual Assistants	Jun 10, 2024	Information Retrievalspeech-recognition	—Unverified
System Description on Automatic Simultaneous Translation Workshop	Jul 1, 2022	SentenceSpeech-to-Text	—Unverified
TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS	Jun 10, 2021	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Telephone Surveys Meet Conversational AI: Evaluating a LLM-Based Telephone Survey System at Scale	Feb 27, 2025	AI AgentLarge Language Model	—Unverified
Telephonetic: Making Neural Language Models Robust to ASR and Semantic Noise	Jun 13, 2019	Data AugmentationDecoder	—Unverified
The 2016 KIT IWSLT Speech-to-Text Systems for English and German	Dec 1, 2016	Speech-to-Text	—Unverified
The 2017 KIT IWSLT Speech-to-Text Systems for English and German	Dec 1, 2017	Speech-to-Text	—Unverified
The AISP-SJTU Simultaneous Translation System for IWSLT 2022	May 1, 2022	Speech-to-TextTranslation	—Unverified
The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation	May 1, 2022	ChunkingSentence	—Unverified
The IWSLT 2019 Evaluation Campaign	Nov 1, 2019	Speech-to-TextTranslation	—Unverified
The MIT Voice Name System	Mar 28, 2022	Speech-to-Text	—Unverified
The Nós Project: Opening routes for the Galician language in the field of language technologies	Jun 1, 2022	Cultural Vocal Bursts Intensity PredictionMachine Translation	—Unverified
The Spotify Podcast Dataset	Apr 8, 2020	Speech-to-Text	—Unverified
The USFD Spoken Language Translation System for IWSLT 2014	Sep 13, 2015	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021	Jul 1, 2021	Data AugmentationSpeech-to-Text	—Unverified

Show:10 25 50

← PrevPage 5 of 9Next →

No leaderboard results yet.