SOTAVerified

Speech-to-Text

Papers

Showing 5175 of 403 papers

TitleStatusHype
A Large-Scale Chinese Multimodal NER Dataset with Speech CluesCode1
Kosp2e: Korean Speech to English Translation CorpusCode1
Towards Automatic Speech to Sign Language GenerationCode1
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech TranslationCode1
Learning Shared Semantic Space for Speech-to-Text TranslationCode1
End-to-end Speech Translation via Cross-modal Progressive TrainingCode1
IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine TranslationCode1
"Listen, Understand and Translate": Triple Supervision Decouples End-to-end Speech-to-text TranslationCode1
Consecutive Decoding for Speech-to-text TranslationCode1
CoVoST 2 and Massively Multilingual Speech-to-Text TranslationCode1
CoVoST: A Diverse Multilingual Speech-To-Text Translation CorpusCode1
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization Approach for Deep Neural NetworksCode1
Stacked DeBERT: All Attention in Incomplete Data for Text ClassificationCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
Clotho: An Audio Captioning DatasetCode1
Deep Reinforcement Learning For Sequence to Sequence ModelsCode1
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization0
End-to-End Speech Translation for Low-Resource Languages Using Weakly Labeled Data0
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs0
S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation0
Advancing STT for Low-Resource Real-World Speech0
Improving Language and Modality Transfer in Translation by Character-level Modeling0
Speech-to-Text Translation with Phoneme-Augmented CoT: Enhancing Cross-Lingual Transfer in Low-Resource Scenarios0
BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation SystemCode0
Show:102550
← PrevPage 3 of 17Next →

No leaderboard results yet.