SOTAVerified

Speech-to-Text Translation

Translate audio signals of speech in one language into text in a foreign language, either in an end-to-end or cascade manner.

Papers

Showing 101146 of 146 papers

TitleStatusHype
TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS0
Lightweight Adapter Tuning for Multilingual Speech TranslationCode1
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech TranslationCode1
End-to-End Speech Translation with Pre-trained Models and Adapters: UPC at IWSLT 2021Code1
Learning Shared Semantic Space for Speech-to-Text TranslationCode1
End-to-end Speech Translation via Cross-modal Progressive TrainingCode1
Towards Measuring Fairness in AI: the Casual Conversations Dataset0
Towards the evaluation of automatic simultaneous speech translation from a communicative perspective0
NeurST: Neural Speech Translation ToolkitCode0
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech TranslationCode1
Bridging the Modality Gap for Speech-to-Text Translation0
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation0
Subtitles to Segmentation: Improving Low-Resource Speech-to-Text Translation Pipelines0
fairseq S2T: Fast Speech-to-Text Modeling with fairseqCode0
Consecutive Decoding for Speech-to-text TranslationCode1
"Listen, Understand and Translate": Triple Supervision Decouples End-to-end Speech-to-text TranslationCode1
Contextualized Translation of Automatically Segmented SpeechCode0
CoVoST 2 and Massively Multilingual Speech-to-Text TranslationCode1
End-to-End Offline Speech Translation System for IWSLT 2020 using Modality Agnostic Meta-Learning0
SimulSpeech: End-to-End Simultaneous Speech to Text Translation0
Self-Supervised Representations Improve End-to-End Speech Translation0
Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation0
Subtitles to Segmentation: Improving Low-Resource Speech-to-TextTranslation Pipelines0
CoVoST: A Diverse Multilingual Speech-To-Text Translation CorpusCode1
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization Approach for Deep Neural NetworksCode1
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive DecodingCode0
A Comparative Study on End-to-end Speech to Text Translation0
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning0
Europarl-ST: A Multilingual Corpus For Speech Translation Of Parliamentary Debates0
Analyzing ASR pretraining for low-resource speech-to-text translation0
Instance-Based Model Adaptation For Direct Speech Translation0
Cross-lingual topic prediction for speech using translations0
Enhancing Transformer for End-to-end Speech-to-Text Translation0
Direct speech-to-speech translation with a sequence-to-sequence modelCode0
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation0
Towards Unsupervised Speech-to-Text Translation0
Pre-training on high-resource speech recognition improves low-resource speech-to-text translationCode0
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces0
Low-Resource Speech-to-Text Translation0
End-to-End Automatic Speech Translation of AudiobooksCode0
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation EvaluationCode0
Interpreting Strategies Annotation in the WAW Corpus0
Using of heterogeneous corpora for training of an ASR system0
Towards speech-to-text translation without speech recognition0
Listen and Translate: A Proof of Concept for End-to-End Speech-to-Text TranslationCode0
The USFD Spoken Language Translation System for IWSLT 20140
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Task Modulation + Multitask Learning(ASR/MT) + Data AugmentationCase-sensitive sacreBLEU28.88Unverified
2Wav2Vec2.0+mBART+AdaptorsCase-sensitive sacreBLEU28.22Unverified
3Transformer + Meta Learning(ASR/MT) + Data AugmentationCase-sensitive sacreBLEU27.51Unverified
4Transformer with AdaptersCase-sensitive sacreBLEU24.63Unverified
5Dual-decoder TransformerCase-sensitive sacreBLEU23.63Unverified
6SpeechformerCase-sensitive sacreBLEU23.6Unverified
7Transformer + ASR PretrainCase-sensitive sacreBLEU22.8Unverified
8Transformer + ASR PretrainCase-sensitive sacreBLEU22.7Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer with AdaptersCase-sensitive sacreBLEU28.73Unverified
2SpeechformerCase-sensitive sacreBLEU28.5Unverified
3Dual-decoder TransformerCase-sensitive sacreBLEU28.12Unverified
4Transformer + ASR Pretrain + SpecAugCase-sensitive sacreBLEU27.4Unverified
5Transformer + ASR PretrainCase-sensitive sacreBLEU26.8Unverified
#ModelMetricClaimedVerifiedStatus
1Dual-decoder TransformerCase-sensitive sacreBLEU33.45Unverified
2Transformer + ASR Pretrain + SpecAugCase-sensitive sacreBLEU33.3Unverified
3Transformer + ASR PretrainCase-sensitive sacreBLEU32.3Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU30.6Unverified
2SeamlessM4T MediumBLEU26.6Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU34.1Unverified
2SeamlessM4T MediumBLEU29.8Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU21.5Unverified
2SeamlessM4T MediumBLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1SeamlessM4T LargeBLEU24Unverified
2SeamlessM4T MediumBLEU20.9Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer + ASR Pretrain + SpecAugCase-insensitive sacreBLEU17.2Unverified
2Transformer + ASR PretrainCase-insensitive sacreBLEU16.5Unverified
#ModelMetricClaimedVerifiedStatus
1MediBeng Whisper TinyBleu0.98Unverified
2Whisper TinyBleu0.3Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer with AdaptersSacreBLEU26.61Unverified
2Dual-decoder TransformerSacreBLEU25.62Unverified
#ModelMetricClaimedVerifiedStatus
1SpeechformerCase-sensitive sacreBLEU27.7Unverified