SOTAVerified

Speech-to-Text Translation

Translate audio signals of speech in one language into text in a foreign language, either in an end-to-end or cascade manner.

Papers

Showing 76100 of 146 papers

TitleStatusHype
Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR0
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation0
Towards speech-to-text translation without speech recognition0
Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems0
Towards the evaluation of automatic simultaneous speech translation from a communicative perspective0
Towards Unsupervised Speech-to-Text Translation0
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning0
Low-Resource Speech-to-Text Translation0
M3ST: Mix at Three Levels for Speech Translation0
Analyzing ASR pretraining for low-resource speech-to-text translation0
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation0
CTC Alignments Improve Autoregressive Translation0
Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer0
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing0
NAIST Simultaneous Speech-to-Text Translation System for IWSLT 20220
NAIST Simultaneous Speech Translation System for IWSLT 20240
Cross-modal Contrastive Learning for Speech Translation0
Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision0
On decoder-only architecture for speech-to-text and large language model integration0
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning0
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling0
Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model0
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces0
Prosody in Cascade and Direct Speech-to-Text Translation: a case study on Korean Wh-Phrases0
Compact Speech Translation Models via Discrete Speech Units Pretraining0
Show:102550
← PrevPage 4 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Task Modulation + Multitask Learning(ASR/MT) + Data AugmentationCase-sensitive sacreBLEU28.88Unverified
2Wav2Vec2.0+mBART+AdaptorsCase-sensitive sacreBLEU28.22Unverified
3Transformer + Meta Learning(ASR/MT) + Data AugmentationCase-sensitive sacreBLEU27.51Unverified
4Transformer with AdaptersCase-sensitive sacreBLEU24.63Unverified
5Dual-decoder TransformerCase-sensitive sacreBLEU23.63Unverified
6SpeechformerCase-sensitive sacreBLEU23.6Unverified
7Transformer + ASR PretrainCase-sensitive sacreBLEU22.8Unverified
8Transformer + ASR PretrainCase-sensitive sacreBLEU22.7Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer with AdaptersCase-sensitive sacreBLEU28.73Unverified
2SpeechformerCase-sensitive sacreBLEU28.5Unverified
3Dual-decoder TransformerCase-sensitive sacreBLEU28.12Unverified
4Transformer + ASR Pretrain + SpecAugCase-sensitive sacreBLEU27.4Unverified
5Transformer + ASR PretrainCase-sensitive sacreBLEU26.8Unverified
#ModelMetricClaimedVerifiedStatus
1Dual-decoder TransformerCase-sensitive sacreBLEU33.45Unverified
2Transformer + ASR Pretrain + SpecAugCase-sensitive sacreBLEU33.3Unverified
3Transformer + ASR PretrainCase-sensitive sacreBLEU32.3Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU30.6Unverified
2SeamlessM4T MediumBLEU26.6Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU34.1Unverified
2SeamlessM4T MediumBLEU29.8Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU21.5Unverified
2SeamlessM4T MediumBLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU24Unverified
2SeamlessM4T MediumBLEU20.9Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer + ASR Pretrain + SpecAugCase-insensitive sacreBLEU17.2Unverified
2Transformer + ASR PretrainCase-insensitive sacreBLEU16.5Unverified
#ModelMetricClaimedVerifiedStatus
1MediBeng Whisper TinyBleu0.98Unverified
2Whisper TinyBleu0.3Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer with AdaptersSacreBLEU26.61Unverified
2Dual-decoder TransformerSacreBLEU25.62Unverified
#ModelMetricClaimedVerifiedStatus
1SpeechformerCase-sensitive sacreBLEU27.7Unverified