SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 1055110600 of 10752 papers

TitleStatusHype
Error profiling for evaluation of machine-translated text: a Polish-English case study0
Assessing the Comparability of News Texts0
Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier0
Latvian and Lithuanian Named Entity Recognition with TildeNER0
Chinese Whispers: Cooperative Paraphrase Acquisition0
PET: a Tool for Post-editing and Assessing Machine Translation0
Linguistic Analysis Processing Line for Bulgarian0
A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora0
Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques0
CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations0
Le Petit Prince in UNL0
LDC Language Resource Database: Building a Bibliographic Database0
BUCEADOR, a multi-language search engine for digital libraries0
Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.0
New language resources for the Pashto language0
Expanding Parallel Resources for Medium-Density Languages for Free0
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser0
Effort of Genre Variation and Prediction of System Performance0
Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner0
Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian0
A Repository of Data and Evaluation Resources for Natural Language Generation0
A Study of Word-Classing for MT Reordering0
Large aligned treebanks for syntax-based machine translation0
A GUI to Detect and Correct Errors in Hindi Dependency Treebank0
Service Composition Scenarios for Task-Oriented Translation0
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs0
Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation0
On the practice of error analysis for machine translation evaluation0
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation0
An Analytical Model of Language Resource Sustainability0
Building a 70 billion word corpus of English from ClueWeb0
Collecting and Using Comparable Corpora for Statistical Machine Translation0
Customization of the Europarl Corpus for Translation Studies0
An implementation of a Latvian resource grammar in Grammatical Framework0
Alignment-based reordering for SMT0
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation0
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information0
Constraint Based Description of Polish Multiword Expressions0
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.0
A finite-state morphological transducer for KyrgyzCode0
Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System0
Parsing Any Domain English text to CoNLL dependencies0
Development and Application of a Cross-language Document Comparability Metric0
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms0
English to Indonesian Transliteration to Support English Pronunciation Practice0
Diversifiable Bootstrapping for Acquiring High-Coverage Paraphrase Resource0
Buildind a Resource of Patterns Using Semantic Types0
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus0
Dealing with unknown words in statistical machine translation0
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation0
Show:102550
← PrevPage 212 of 216Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer Cycle (Rev)BLEU score35.14Unverified
2Noisy back-translationBLEU score35Unverified
3Transformer+Rep(Uni)BLEU score33.89Unverified
4T5-11BBLEU score32.1Unverified
5BiBERTBLEU score31.26Unverified
6Transformer + R-DropBLEU score30.91Unverified
7Bi-SimCutBLEU score30.78Unverified
8BERT-fused NMTBLEU score30.75Unverified
9Data Diversification - TransformerBLEU score30.7Unverified
10SimCutBLEU score30.56Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer+BT (ADMIN init)BLEU score46.4Unverified
2Noisy back-translationBLEU score45.6Unverified
3mRASP+Fine-TuneBLEU score44.3Unverified
4Transformer + R-DropBLEU score43.95Unverified
5Transformer (ADMIN init)BLEU score43.8Unverified
6AdminBLEU score43.8Unverified
7BERT-fused NMTBLEU score43.78Unverified
8MUSE(Paralllel Multi-scale Attention)BLEU score43.5Unverified
9T5BLEU score43.4Unverified
10Local Joint Self-attentionBLEU score43.3Unverified
#ModelMetricClaimedVerifiedStatus
1PiNMTBLEU score40.43Unverified
2BiBERTBLEU score38.61Unverified
3Bi-SimCutBLEU score38.37Unverified
4Cutoff + Relaxed Attention + LMBLEU score37.96Unverified
5DRDABLEU score37.95Unverified
6Transformer + R-Drop + CutoffBLEU score37.9Unverified
7SimCutBLEU score37.81Unverified
8Cutoff+KneeBLEU score37.78Unverified
9CutoffBLEU score37.6Unverified
10CipherDAugBLEU score37.53Unverified
#ModelMetricClaimedVerifiedStatus
1HWTSC-Teacher-SimScore19.97Unverified
2MS-COMET-22Score19.89Unverified
3MS-COMET-QE-22Score19.76Unverified
4KG-BERTScoreScore17.28Unverified
5metricx_xl_DA_2019Score17.17Unverified
6COMET-QEScore16.8Unverified
7COMET-22Score16.31Unverified
8UniTE-srcScore15.68Unverified
9UniTE-refScore15.38Unverified
10metricx_xxl_DA_2019Score15.24Unverified