SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 1060110650 of 10752 papers

TitleStatusHype
Two Phase Evaluation for Selecting Machine Translation Services0
The FLaReNet Strategic Language Resource Agenda0
Text Simplification Tools for Spanish0
Turkish Paraphrase Corpus0
Using Noun Similarity to Adapt an Acceptability Measure for Persian Light Verb Constructions0
The META-SHARE Language Resources Sharing Infrastructure: Principles, Challenges, Solutions0
The META-SHARE Metadata Schema for the Description of Language Resources0
Tajik-Farsi Persian Transliteration Using Statistical Machine Translation0
Tools for plWordNet Development. Presentation and Perspectives0
The ML4HMT Workshop on Optimising the Division of Labour in Hybrid Machine Translation0
Word Alignment for English-Turkish Language Pair0
SUMAT: Data Collection and Parallel Corpus Compilation for Machine Translation of Subtitles0
Using Wikipedia to Validate the Terminology found in a Corpus of Basic Textbooks0
Two Database Resources for Processing Social Media English Text0
The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output0
The Parallel-TUT: a multilingual and multiformat treebank0
The Joy of Parallelism with CzEng 1.00
Towards a User-Friendly Platform for Building Language Resources based on Web Services0
Suffix Trees as Language Models0
The IMAGACT Cross-linguistic Ontology of Action. A new infrastructure for natural language disambiguation0
Turk Bootstrap Word Sense Inventory 2.0: A Large-Scale Resource for Lexical Substitution0
Terra: a Collection of Translation Error-Annotated Corpora0
TimeBankPT: A TimeML Annotated Corpus of Portuguese0
Translog-II: a Program for Recording User Activity Data for Empirical Reading and Writing Research0
The Language Library: supporting community effort for collective resource production0
Using semi-experts to derive judgments on word sense alignment: a pilot study0
Holaaa!! writin like u talk is kewl but kinda hard 4 NLP0
Large Scale Lexical Analysis0
DSim, a Danish Parallel Corpus for Text Simplification0
Constructing Large Proposition Databases0
Automatic lexical semantic classification of nouns0
Extracting Directional and Comparable Corpora from a Multilingual Corpus for Translation Studies0
Annotating dropped pronouns in Chinese newswire text0
A Richly Annotated, Multilingual Parallel Corpus for Hybrid Machine Translation0
Identifying Nuggets of Information in GALE Distillation Evaluation0
Evaluating Appropriateness Of System Responses In A Spoken CALL Game0
An English-Portuguese parallel corpus of questions: translation guidelines and application in SMT0
A tool for enhanced search of multilingual digital libraries of e-journals0
DISLOG: A logic-based language for processing discourse structures0
Free/Open Source Shallow-Transfer Based Machine Translation for Spanish and Aragonese0
Design and compilation of a specialized Spanish-German parallel corpus0
A High-Quality Web Corpus of Czech0
A new semantically annotated corpus with syntactic-semantic and cross-lingual senses0
Discourse-level Annotation over Europarl for Machine Translation: Connectives and Pronouns0
Can Statistical Post-Editing with a Small Parallel Corpus Save a Weak MT Engine?0
Correlation between Similarity Measures for Inter-Language Linked Wikipedia Articles0
Source-Language Dictionaries Help Non-Expert Users to Enlarge Target-Language Dictionaries for Machine Translation0
Automatic MT Error Analysis: Hjerson Helping Addicter0
Experiences in Resource Generation for Machine Translation through Crowdsourcing0
On Using Linked Data for Language Resource Sharing in the Long Tail of the Localisation Market0
Show:102550
← PrevPage 213 of 216Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer Cycle (Rev)BLEU score35.14Unverified
2Noisy back-translationBLEU score35Unverified
3Transformer+Rep(Uni)BLEU score33.89Unverified
4T5-11BBLEU score32.1Unverified
5BiBERTBLEU score31.26Unverified
6Transformer + R-DropBLEU score30.91Unverified
7Bi-SimCutBLEU score30.78Unverified
8BERT-fused NMTBLEU score30.75Unverified
9Data Diversification - TransformerBLEU score30.7Unverified
10SimCutBLEU score30.56Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer+BT (ADMIN init)BLEU score46.4Unverified
2Noisy back-translationBLEU score45.6Unverified
3mRASP+Fine-TuneBLEU score44.3Unverified
4Transformer + R-DropBLEU score43.95Unverified
5Transformer (ADMIN init)BLEU score43.8Unverified
6AdminBLEU score43.8Unverified
7BERT-fused NMTBLEU score43.78Unverified
8MUSE(Paralllel Multi-scale Attention)BLEU score43.5Unverified
9T5BLEU score43.4Unverified
10Local Joint Self-attentionBLEU score43.3Unverified
#ModelMetricClaimedVerifiedStatus
1PiNMTBLEU score40.43Unverified
2BiBERTBLEU score38.61Unverified
3Bi-SimCutBLEU score38.37Unverified
4Cutoff + Relaxed Attention + LMBLEU score37.96Unverified
5DRDABLEU score37.95Unverified
6Transformer + R-Drop + CutoffBLEU score37.9Unverified
7SimCutBLEU score37.81Unverified
8Cutoff+KneeBLEU score37.78Unverified
9CutoffBLEU score37.6Unverified
10CipherDAugBLEU score37.53Unverified
#ModelMetricClaimedVerifiedStatus
1HWTSC-Teacher-SimScore19.97Unverified
2MS-COMET-22Score19.89Unverified
3MS-COMET-QE-22Score19.76Unverified
4KG-BERTScoreScore17.28Unverified
5metricx_xl_DA_2019Score17.17Unverified
6COMET-QEScore16.8Unverified
7COMET-22Score16.31Unverified
8UniTE-srcScore15.68Unverified
9UniTE-refScore15.38Unverified
10metricx_xxl_DA_2019Score15.24Unverified