Machine Translation
Machine translation is the task of translating a sentence in a source language to a different target language.
Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.
One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.
( Image credit: Google seq2seq )
Papers
Showing 1–10 of 10752 papers
All datasetsWMT2014 English-GermanWMT2014 English-FrenchIWSLT2014 German-EnglishACESWMT2016 English-RomanianWMT2016 Romanian-EnglishWMT2014 German-EnglishIWSLT2015 German-EnglishWMT2016 English-GermanIWSLT2015 English-VietnameseIWSLT2015 English-GermanWMT2016 German-English
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | fast-noisy-channel-modeling | BLEU score | 40.3 | — | Unverified |
| 2 | FLAN 137B (few-shot, k=9) | BLEU score | 38.1 | — | Unverified |
| 3 | BART (TextBox 2.0) | BLEU-4 | 37.48 | — | Unverified |
| 4 | FLAN 137B (zero-shot) | BLEU score | 37.3 | — | Unverified |
| 5 | MLM pretraining | BLEU score | 35.3 | — | Unverified |
| 6 | GenTranslate | BLEU score | 33.5 | — | Unverified |
| 7 | Attentional encoder-decoder + BPE | BLEU score | 33.3 | — | Unverified |
| 8 | CMLM+LAT+4 iterations | BLEU score | 33.26 | — | Unverified |
| 9 | Levenshtein Transformer (distillation) | BLEU score | 33.26 | — | Unverified |
| 10 | Adaptively Sparse Transformer (1.5-entmax) | BLEU score | 33.1 | — | Unverified |