Speech-to-Text

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–325 of 403 papers

Title	Date	Tasks	Status
ON-TRAC Consortium for End-to-End and Simultaneous Speech Translation Challenge Tasks at IWSLT 2020	May 24, 2020	Data AugmentationDecoder	—Unverified
Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility	Feb 5, 2022	Speech EnhancementSpeech-to-Text	—Unverified
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification	Feb 20, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction	Feb 10, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling	Jun 21, 2021	speech-recognitionSpeech Recognition	—Unverified
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks	Oct 21, 2019	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Performance Comparison of Pre-trained Models for Speech-to-Text in Turkish: Whisper-Small and Wav2Vec2-XLS-R-300M	Jul 6, 2023	Speech-to-Text	—Unverified
PhantomSound: Black-Box, Query-Efficient Audio Adversarial Attack via Split-Second Phoneme Injection	Sep 13, 2023	Adversarial AttackSpeech-to-Text	—Unverified
Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili	Oct 29, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Polish Read Speech Corpus for Speech Tools and Services	Jun 1, 2017	Action DetectionActivity Detection	—Unverified
Prepending or Cross-Attention for Speech-to-Text? An Empirical Comparison	Jan 4, 2025	DecoderKnowledge Distillation	—Unverified
Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases	Feb 1, 2024	speech-recognitionSpeech Recognition	—Unverified
Punctuation restoration in Swedish through fine-tuned KB-BERT	Feb 14, 2022	Language ModellingPunctuation Restoration	—Unverified
Pushing the performances of ASR models on English and Spanish accents	Dec 22, 2022	Speech-to-Text	—Unverified
Recent Advances in Direct Speech-to-text Translation	Jun 20, 2023	Data AugmentationDecoder	—Unverified
Representation Purification for End-to-End Speech Translation	Dec 5, 2024	Machine TranslationRhythm	—Unverified
Revisiting End-to-End Speech-to-Text Translation From Scratch	Jun 9, 2022	Decoderspeech-recognition	—Unverified
Revisiting the Entropy Semiring for Neural Speech Recognition	Dec 13, 2023	speech-recognitionSpeech Recognition	—Unverified
Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking	Mar 13, 2024	Chinese Spell CheckingIn-Context Learning	—Unverified
Robust Semantic Communications for Speech Transmission	Mar 8, 2024	Generative Adversarial NetworkSemantic Communication	—Unverified
Role of Intonation in Scoring Spoken English	Aug 23, 2018	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks	Jul 14, 2022	Speech-to-Text	—Unverified
S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation	Jun 11, 2025	Reading ComprehensionSpeech Synthesis	—Unverified
SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation	Oct 13, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation	May 17, 2022	Representation LearningRetrieval	—Unverified

Show:10 25 50

← PrevPage 13 of 17Next →

No leaderboard results yet.