Speech-to-Text Translation
Translate audio signals of speech in one language into text in a foreign language, either in an end-to-end or cascade manner.
Papers
Showing 1–10 of 146 papers
All datasetsMuST-C EN->DEMuST-C EN->ESMuST-C EN->FRCoVoST 2 eng-XCoVoST 2 X-engFLEURS eng-XFLEURS X-englibri-transMediBengMuST-CMuST-C EN->NL
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Task Modulation + Multitask Learning(ASR/MT) + Data Augmentation | Case-sensitive sacreBLEU | 28.88 | — | Unverified |
| 2 | Wav2Vec2.0+mBART+Adaptors | Case-sensitive sacreBLEU | 28.22 | — | Unverified |
| 3 | Transformer + Meta Learning(ASR/MT) + Data Augmentation | Case-sensitive sacreBLEU | 27.51 | — | Unverified |
| 4 | Transformer with Adapters | Case-sensitive sacreBLEU | 24.63 | — | Unverified |
| 5 | Dual-decoder Transformer | Case-sensitive sacreBLEU | 23.63 | — | Unverified |
| 6 | Speechformer | Case-sensitive sacreBLEU | 23.6 | — | Unverified |
| 7 | Transformer + ASR Pretrain | Case-sensitive sacreBLEU | 22.8 | — | Unverified |
| 8 | Transformer + ASR Pretrain | Case-sensitive sacreBLEU | 22.7 | — | Unverified |