SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 93019350 of 10752 papers

TitleStatusHype
Transforming trees into hedges and parsing with ``hedgebank'' grammars0
Translation Class Instruction as Collaboration in the Act of Translation0
Two Knives Cut Better Than One: Chinese Word Segmentation with Dual Decomposition0
Translation Assistance by Translation of L1 Fragments in an L2 Context0
The KIT-LIMSI Translation System for WMT 20140
The UA-Prompsit hybrid machine translation system for the 2014 Workshop on Statistical Machine Translation0
Tolerant BLEU: a Submission to the WMT14 Metrics Task0
VERTa participation in the WMT14 Metrics Task0
Towards a discourse relation-aware approach for Chinese-English machine translation0
Yandex School of Data Analysis Russian-English Machine Translation System for WMT140
Toward Better Chinese Word Segmentation for SMT via Bilingual Constraints0
Tuning a Grammar Correction System for Increased Precision0
Unsupervised Adaptation for Statistical Machine Translation0
The IIT Bombay Hindi-English Translation System at WMT 20140
Word Segmentation of Informal Arabic with Domain Adaptation0
Unsupervised Dependency Parsing with Transferring Distribution via Parallel Guidance and Entropy Regularization0
Unsupervised Techniques for Extracting and Clustering Complex Events in News0
Using Discourse Structure Improves Machine Translation Evaluation0
Unsupervised Feature Learning for Visual Sign Language Identification0
Using Comparable Corpora to Adapt MT Models to New Domains0
Treebank Translation for Cross-Lingual Parser Induction0
Unsupervised Morphology-Based Vocabulary Expansion0
Machine Translation Model based on Non-parallel Corpus and Semi-supervised Transductive Learning0
Fighting Authorship Linkability with Crowdsourcing0
Developing Corpus-based Translation Methods between Informal and Formal Mathematics: Project Description0
Coordinate System Selection for Minimum Error Rate Training in Statistical Machine Translation0
Learning Alternative Name Spellings0
Analysis Tool for UNL-Based Knowledge Representation0
Boosting the creation of a treebank0
Bootstrapping an Italian VerbNet: data-driven analysis of verb alternations0
PropBank: Semantics of New Predicate Types0
SWIFT Aligner, A Multifunctional Tool for Parallel Corpora: Visualization, Word Alignment, and (Morpho)-Syntactic Cross-Language Transfer0
Named Entity Tagging a Very Large Unbalanced Corpus: Training and Evaluating NE Classifiers0
Bilingual Dictionary Construction with Transliteration Filtering0
Bilingual dictionaries for all EU languagesCode0
Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech0
Multiword Expressions in Machine Translation0
Bootstrapping Open-Source English-Bulgarian Computational Dictionary0
Cross-Language Authorship Attribution0
Shata-Anuvadak: Tackling Multiway Translation of Indian Languages0
Sharing resources between free/open-source rule-based machine translation systems: Grammatical Framework and Apertium0
All Fragments Count in Parser Evaluation0
Multilingual Test Sets for Machine Translation of Search Queries for Cross-Lingual Information Retrieval in the Medical Domain0
Creating and using large monolingual parallel corpora for sentential paraphrase generation0
Creating a massively parallel Bible corpus0
Benchmarking of English-Hindi parallel corpora0
On Complex Word Alignment Configurations0
Aggregation methods for efficient collocation detection0
Dual Subtitles as Parallel Corpora0
Quality Estimation for Synthetic Parallel Data Generation0
Show:102550
← PrevPage 187 of 216Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer Cycle (Rev)BLEU score35.14Unverified
2Noisy back-translationBLEU score35Unverified
3Transformer+Rep(Uni)BLEU score33.89Unverified
4T5-11BBLEU score32.1Unverified
5BiBERTBLEU score31.26Unverified
6Transformer + R-DropBLEU score30.91Unverified
7Bi-SimCutBLEU score30.78Unverified
8BERT-fused NMTBLEU score30.75Unverified
9Data Diversification - TransformerBLEU score30.7Unverified
10SimCutBLEU score30.56Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer+BT (ADMIN init)BLEU score46.4Unverified
2Noisy back-translationBLEU score45.6Unverified
3mRASP+Fine-TuneBLEU score44.3Unverified
4Transformer + R-DropBLEU score43.95Unverified
5Transformer (ADMIN init)BLEU score43.8Unverified
6AdminBLEU score43.8Unverified
7BERT-fused NMTBLEU score43.78Unverified
8MUSE(Paralllel Multi-scale Attention)BLEU score43.5Unverified
9T5BLEU score43.4Unverified
10Local Joint Self-attentionBLEU score43.3Unverified
#ModelMetricClaimedVerifiedStatus
1PiNMTBLEU score40.43Unverified
2BiBERTBLEU score38.61Unverified
3Bi-SimCutBLEU score38.37Unverified
4Cutoff + Relaxed Attention + LMBLEU score37.96Unverified
5DRDABLEU score37.95Unverified
6Transformer + R-Drop + CutoffBLEU score37.9Unverified
7SimCutBLEU score37.81Unverified
8Cutoff+KneeBLEU score37.78Unverified
9CutoffBLEU score37.6Unverified
10CipherDAugBLEU score37.53Unverified
#ModelMetricClaimedVerifiedStatus
1HWTSC-Teacher-SimScore19.97Unverified
2MS-COMET-22Score19.89Unverified
3MS-COMET-QE-22Score19.76Unverified
4KG-BERTScoreScore17.28Unverified
5metricx_xl_DA_2019Score17.17Unverified
6COMET-QEScore16.8Unverified
7COMET-22Score16.31Unverified
8UniTE-srcScore15.68Unverified
9UniTE-refScore15.38Unverified
10metricx_xxl_DA_2019Score15.24Unverified