SOTAVerified

Speech-to-Text Translation

Translate audio signals of speech in one language into text in a foreign language, either in an end-to-end or cascade manner.

Papers

Showing 101125 of 146 papers

TitleStatusHype
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation0
XTREME-S: Evaluating Cross-lingual Speech Representations0
Cross-modal Contrastive Learning for Speech Translation0
Improve Sinhala Speech Recognition Through e2e LF-MMI Model0
An Experiment on Speech-to-Text Translation Systems for Manipuri to English on Low Resource Setting0
Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems0
Speechformer: Reducing Information Loss in Direct Speech TranslationCode0
Infusing Future Information into Monotonic Attention Through Language ModelsCode0
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task0
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling0
Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR0
TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS0
Towards Measuring Fairness in AI: the Casual Conversations Dataset0
Towards the evaluation of automatic simultaneous speech translation from a communicative perspective0
NeurST: Neural Speech Translation ToolkitCode0
Bridging the Modality Gap for Speech-to-Text Translation0
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation0
Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines0
fairseq S2T: Fast Speech-to-Text Modeling with fairseqCode0
Contextualized Translation of Automatically Segmented SpeechCode0
End-to-End Offline Speech Translation System for IWSLT 2020 using Modality Agnostic Meta-Learning0
SimulSpeech: End-to-End Simultaneous Speech to Text Translation0
Self-Supervised Representations Improve End-to-End Speech Translation0
Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation0
Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines0
Show:102550
← PrevPage 5 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Task Modulation + Multitask Learning(ASR/MT) + Data AugmentationCase-sensitive sacreBLEU28.88Unverified
2Wav2Vec2.0+mBART+AdaptorsCase-sensitive sacreBLEU28.22Unverified
3Transformer + Meta Learning(ASR/MT) + Data AugmentationCase-sensitive sacreBLEU27.51Unverified
4Transformer with AdaptersCase-sensitive sacreBLEU24.63Unverified
5Dual-decoder TransformerCase-sensitive sacreBLEU23.63Unverified
6SpeechformerCase-sensitive sacreBLEU23.6Unverified
7Transformer + ASR PretrainCase-sensitive sacreBLEU22.8Unverified
8Transformer + ASR PretrainCase-sensitive sacreBLEU22.7Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer with AdaptersCase-sensitive sacreBLEU28.73Unverified
2SpeechformerCase-sensitive sacreBLEU28.5Unverified
3Dual-decoder TransformerCase-sensitive sacreBLEU28.12Unverified
4Transformer + ASR Pretrain + SpecAugCase-sensitive sacreBLEU27.4Unverified
5Transformer + ASR PretrainCase-sensitive sacreBLEU26.8Unverified
#ModelMetricClaimedVerifiedStatus
1Dual-decoder TransformerCase-sensitive sacreBLEU33.45Unverified
2Transformer + ASR Pretrain + SpecAugCase-sensitive sacreBLEU33.3Unverified
3Transformer + ASR PretrainCase-sensitive sacreBLEU32.3Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU30.6Unverified
2SeamlessM4T MediumBLEU26.6Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU34.1Unverified
2SeamlessM4T MediumBLEU29.8Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU21.5Unverified
2SeamlessM4T MediumBLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU24Unverified
2SeamlessM4T MediumBLEU20.9Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer + ASR Pretrain + SpecAugCase-insensitive sacreBLEU17.2Unverified
2Transformer + ASR PretrainCase-insensitive sacreBLEU16.5Unverified
#ModelMetricClaimedVerifiedStatus
1MediBeng Whisper TinyBleu0.98Unverified
2Whisper TinyBleu0.3Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer with AdaptersSacreBLEU26.61Unverified
2Dual-decoder TransformerSacreBLEU25.62Unverified
#ModelMetricClaimedVerifiedStatus
1SpeechformerCase-sensitive sacreBLEU27.7Unverified