SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 90519100 of 10752 papers

TitleStatusHype
Orthographic and Morphological Processing for Persian-to-English Statistical Machine Translation0
Orthographic Features for Bilingual Lexicon Induction0
OSN-MDAD: Machine Translation Dataset for Arabic Multi-Dialectal Conversations on Online Social Media0
OSU Multimodal Machine Translation System Report0
Our Neural Machine Translation Systems for WAT 20190
Out-of-Order Decoding for Robust Neural Machine Translation0
Out-of-the-box Universal Romanization Tool uroman0
Overcoming a Theoretical Limitation of Self-Attention0
Overcoming Catastrophic Forgetting During Domain Adaptation of Neural Machine Translation0
Overcoming Resistance: The Normalization of an Amazonian Tribal Language0
Overcoming the Curse of Sentence Length for Neural Machine Translation using Automatic Segmentation0
Overcoming the Rare Word Problem for Low-Resource Language Pairs in Neural Machine Translation0
Overcoming Vocabulary Constraints with Pixel-level Fallback0
Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation0
Overview of the 1st Workshop on Asian Translation0
Overview of the 2013 ALTA Shared Task0
Overview of the 2nd Workshop on Asian Translation0
Overview of the 3rd Workshop on Asian Translation0
Overview of the 4th Workshop on Asian Translation0
Overview of the IWSLT 2017 Evaluation Campaign0
Overview of the Second BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora0
Overview of the Shared Task on Machine Translation in Dravidian Languages0
PaCCSS-IT: A Parallel Corpus of Complex-Simple Sentences for Automatic Text Simplification0
PaCMan : Parallel Corpus Management Workbench0
PAD: Towards Efficient Data Generation for Transfer Learning Using Phrase Alignment0
Pair Language Models for Deriving Alternative Pronunciations and Spellings from Pronunciation Dictionaries0
PairReranker: Pairwise Reranking for Natural Language Generation0
Pairwise Neural Machine Translation Evaluation0
PangeaMT v 3 – customise your own machine translation environment0
PanGu-Σ: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing0
Panlingua-KMI MT System for Similar Language Translation Task at WMT 20190
papago: A Machine Translation Service with Word Sense Disambiguation and Currency Conversion0
ParaBank: Monolingual Bitext Generation and Sentential Paraphrasing via Lexically-constrained Neural Machine Translation0
ParaCotta: Synthetic Multilingual Paraphrase Corpora from the Most Diverse Translation Sample Pair0
ParaDi: Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs0
PARADIGM: Paraphrase Diagnostics through Grammar Matching0
PARADISE”:" Exploiting Parallel Data for Multilingual Sequence-to-Sequence Pretraining0
Parallel Aligned Treebanks at LDC: New Challenges Interfacing Existing Infrastructures0
Parallel Attention Forcing for Machine Translation0
Parallel Attention Mechanisms in Neural Machine Translation0
Parallel Corpora for bi-Directional Statistical Machine Translation for Seven Ethiopian Language Pairs0
Parallel Corpora for bi-lingual English-Ethiopian Languages Statistical Machine Translation0
Parallel Corpora in Mboshi (Bantu C25, Congo-Brazzaville)0
Parallel Corpus Filtering via Pre-trained Language Models0
Parallel Data, Tools and Interfaces in OPUS0
Parallel FDA5 for Fast Deployment of Accurate Statistical Machine Translation Systems0
Parallelizing Word2Vec in Shared and Distributed Memory0
Parallel resources for Tunisian Arabic Dialect Translation0
Parallel Sentence Compression0
Parallel Sentence Extraction from Comparable Corpora with Neural Network Features0
Show:102550
← PrevPage 182 of 216Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer Cycle (Rev)BLEU score35.14Unverified
2Noisy back-translationBLEU score35Unverified
3Transformer+Rep(Uni)BLEU score33.89Unverified
4T5-11BBLEU score32.1Unverified
5BiBERTBLEU score31.26Unverified
6Transformer + R-DropBLEU score30.91Unverified
7Bi-SimCutBLEU score30.78Unverified
8BERT-fused NMTBLEU score30.75Unverified
9Data Diversification - TransformerBLEU score30.7Unverified
10SimCutBLEU score30.56Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer+BT (ADMIN init)BLEU score46.4Unverified
2Noisy back-translationBLEU score45.6Unverified
3mRASP+Fine-TuneBLEU score44.3Unverified
4Transformer + R-DropBLEU score43.95Unverified
5Transformer (ADMIN init)BLEU score43.8Unverified
6AdminBLEU score43.8Unverified
7BERT-fused NMTBLEU score43.78Unverified
8MUSE(Paralllel Multi-scale Attention)BLEU score43.5Unverified
9T5BLEU score43.4Unverified
10Local Joint Self-attentionBLEU score43.3Unverified
#ModelMetricClaimedVerifiedStatus
1PiNMTBLEU score40.43Unverified
2BiBERTBLEU score38.61Unverified
3Bi-SimCutBLEU score38.37Unverified
4Cutoff + Relaxed Attention + LMBLEU score37.96Unverified
5DRDABLEU score37.95Unverified
6Transformer + R-Drop + CutoffBLEU score37.9Unverified
7SimCutBLEU score37.81Unverified
8Cutoff+KneeBLEU score37.78Unverified
9CutoffBLEU score37.6Unverified
10CipherDAugBLEU score37.53Unverified
#ModelMetricClaimedVerifiedStatus
1HWTSC-Teacher-SimScore19.97Unverified
2MS-COMET-22Score19.89Unverified
3MS-COMET-QE-22Score19.76Unverified
4KG-BERTScoreScore17.28Unverified
5metricx_xl_DA_2019Score17.17Unverified
6COMET-QEScore16.8Unverified
7COMET-22Score16.31Unverified
8UniTE-srcScore15.68Unverified
9UniTE-refScore15.38Unverified
10metricx_xxl_DA_2019Score15.24Unverified