SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 1060110650 of 10752 papers

TitleStatusHype
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser0
Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier0
Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian0
A Study of Word-Classing for MT Reordering0
Expanding Parallel Resources for Medium-Density Languages for Free0
Experiences in Resource Generation for Machine Translation through Crowdsourcing0
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms0
Rich Morphology Generation Using Statistical Machine Translation0
Tajik-Farsi Persian Transliteration Using Statistical Machine Translation0
Measuring Interlanguage: Native Language Identification with L1-influence Metrics0
Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner0
Assessing the Comparability of News Texts0
Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System0
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies0
Eye Tracking as a Tool for Machine Translation Error Analysis0
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information0
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus0
An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT0
Shared Task Proposal: Syntactic Paraphrase Ranking0
Acquisition of Syntactic Simplification Rules for French0
Service Composition Scenarios for Task-Oriented Translation0
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation0
The FLaReNet Strategic Language Resource Agenda0
Linguistic Analysis Processing Line for Bulgarian0
Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation0
A Rough Set Formalization of Quantitative Evaluation with Ambiguity0
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.0
A contrastive review of paraphrase acquisition techniques0
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation0
Light Verb Constructions in the SzegedParalellFX English--Hungarian Parallel Corpus0
Terra: a Collection of Translation Error-Annotated Corpora0
Le Petit Prince in UNL0
A Repository of Data and Evaluation Resources for Natural Language Generation0
Annotating dropped pronouns in Chinese newswire text0
A new semantically annotated corpus with syntactic-semantic and cross-lingual senses0
Text Simplification Tools for Spanish0
Arabic-Segmentation Combination Strategies for Statistical Machine Translation0
An implementation of a Latvian resource grammar in Grammatical Framework0
Holaaa!! writin like u talk is kewl but kinda hard 4 NLP0
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs0
LDC Language Resource Database: Building a Bibliographic Database0
HunOr: A Hungarian---Russian Parallel Corpus0
Latvian and Lithuanian Named Entity Recognition with TildeNER0
A light way to collect comparable corpora from the Web0
BUCEADOR, a multi-language search engine for digital libraries0
Buildind a Resource of Patterns Using Semantic Types0
Building a 70 billion word corpus of English from ClueWeb0
Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?0
PEXACC: A Parallel Sentence Mining Algorithm from Comparable Corpora0
PET: a Tool for Post-editing and Assessing Machine Translation0
Show:102550
← PrevPage 213 of 216Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer Cycle (Rev)BLEU score35.14Unverified
2Noisy back-translationBLEU score35Unverified
3Transformer+Rep(Uni)BLEU score33.89Unverified
4T5-11BBLEU score32.1Unverified
5BiBERTBLEU score31.26Unverified
6Transformer + R-DropBLEU score30.91Unverified
7Bi-SimCutBLEU score30.78Unverified
8BERT-fused NMTBLEU score30.75Unverified
9Data Diversification - TransformerBLEU score30.7Unverified
10SimCutBLEU score30.56Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer+BT (ADMIN init)BLEU score46.4Unverified
2Noisy back-translationBLEU score45.6Unverified
3mRASP+Fine-TuneBLEU score44.3Unverified
4Transformer + R-DropBLEU score43.95Unverified
5Transformer (ADMIN init)BLEU score43.8Unverified
6AdminBLEU score43.8Unverified
7BERT-fused NMTBLEU score43.78Unverified
8MUSE(Paralllel Multi-scale Attention)BLEU score43.5Unverified
9T5BLEU score43.4Unverified
10Local Joint Self-attentionBLEU score43.3Unverified
#ModelMetricClaimedVerifiedStatus
1PiNMTBLEU score40.43Unverified
2BiBERTBLEU score38.61Unverified
3Bi-SimCutBLEU score38.37Unverified
4Cutoff + Relaxed Attention + LMBLEU score37.96Unverified
5DRDABLEU score37.95Unverified
6Transformer + R-Drop + CutoffBLEU score37.9Unverified
7SimCutBLEU score37.81Unverified
8Cutoff+KneeBLEU score37.78Unverified
9CutoffBLEU score37.6Unverified
10CipherDAugBLEU score37.53Unverified
#ModelMetricClaimedVerifiedStatus
1HWTSC-Teacher-SimScore19.97Unverified
2MS-COMET-22Score19.89Unverified
3MS-COMET-QE-22Score19.76Unverified
4KG-BERTScoreScore17.28Unverified
5metricx_xl_DA_2019Score17.17Unverified
6COMET-QEScore16.8Unverified
7COMET-22Score16.31Unverified
8UniTE-srcScore15.68Unverified
9UniTE-refScore15.38Unverified
10metricx_xxl_DA_2019Score15.24Unverified