Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1275 of 10752 papers

Title	Date	Tasks	Status	Score
A Study in Improving BLEU Reference Coverage with Diverse Automatic Paraphrasing	Apr 30, 2020	Machine TranslationSentence	CodeCode Available	5
Fast and Simple Mixture of Softmaxes with BPE and Hybrid-LightRNN for Language Generation	Sep 25, 2018	Image CaptioningMachine Translation	CodeCode Available	5
From Gameplay to Symbolic Reasoning: Learning SAT Solver Heuristics in the Style of Alpha(Go) Zero	Feb 14, 2018	Decision MakingDeep Reinforcement Learning	CodeCode Available	5
Evaluating Structural Generalization in Neural Machine Translation	Jun 19, 2024	Machine TranslationSemantic Parsing	CodeCode Available	5
Evaluating Sequence-to-Sequence Models for Handwritten Text Recognition	Mar 18, 2019	DecoderHandwritten Text Recognition	CodeCode Available	5
Evaluating the morphological competence of Machine Translation Systems	Sep 1, 2017	Machine TranslationTranslation	CodeCode Available	5
Evaluating Pronominal Anaphora in Machine Translation: An Evaluation Measure and a Test Suite	Aug 31, 2019	Machine TranslationSentence	CodeCode Available	5
An Empirical Study on the Robustness of Massively Multilingual Neural Machine Translation	May 13, 2024	Machine TranslationTranslation	CodeCode Available	5
Evaluating Rewards for Question Generation Models	Feb 28, 2019	Machine TranslationPolicy Gradient Methods	CodeCode Available	5
Evaluating the Morphosyntactic Well-formedness of Generated Texts	Mar 30, 2021	Machine TranslationText Generation	CodeCode Available	5
Evaluating Machine Translation Models for English-Hindi Language Pairs: A Comparative Analysis	May 26, 2025	Machine TranslationTranslation	CodeCode Available	5
An Empirical Study of Translation Hypothesis Ensembling with Large Language Models	Oct 17, 2023	DiversityMachine Translation	CodeCode Available	5
A Call for Clarity in Reporting BLEU Scores	Apr 23, 2018	Machine TranslationTranslation	CodeCode Available	5
Evaluating Optimal Reference Translations	Nov 28, 2023	Machine TranslationTranslation	CodeCode Available	5
Evaluation of Chinese-English Machine Translation of Emotion-Loaded Microblog Texts: A Human Annotated Dataset for the Quality Assessment of Emotion Translation	Jun 20, 2023	Machine TranslationNegation	CodeCode Available	5
Estimating post-editing effort: a study on human judgements, task-based and reference-based metrics of MT quality	Oct 14, 2019	Machine TranslationTranslation	CodeCode Available	5
Evaluating Automatic Metrics with Incremental Machine Translation Systems	Jul 3, 2024	Machine TranslationTranslation	CodeCode Available	5
Escaping the sentence-level paradigm in machine translation	Apr 25, 2023	de-enMachine Translation	CodeCode Available	5
Evaluating bilingual word embeddings on the long tail	Jun 1, 2018	Bilingual Lexicon InductionMachine Translation	CodeCode Available	5
Equalizing Gender Biases in Neural Machine Translation with Word Embeddings Techniques	Jan 10, 2019	FairnessMachine Translation	CodeCode Available	5
Error Analysis of Cross-lingual Tagging and Parsing	Jan 1, 2017	Machine Translation	CodeCode Available	5
Entity Projection via Machine Translation for Cross-Lingual NER	Aug 31, 2019	Cross-Lingual NERMachine Translation	CodeCode Available	5
Evaluating Gender Bias in German Machine Translation	Feb 26, 2025	Language ModelingLanguage Modelling	CodeCode Available	5
Evaluation of Google Translate for Mandarin Chinese translation using sentiment and semantic analysis	Sep 8, 2024	Machine TranslationSentiment Analysis	CodeCode Available	5
Advancing Neural Network Performance through Emergence-Promoting Initialization Scheme	Jul 26, 2024	Machine Translation	CodeCode Available	5

Show:10 25 50

← PrevPage 51 of 431Next →

All datasets WMT2014 English-German WMT2014 English-French IWSLT2014 German-English ACES WMT2016 English-Romanian WMT2016 Romanian-English WMT2014 German-English IWSLT2015 German-English WMT2016 English-German IWSLT2015 English-Vietnamese IWSLT2015 English-German WMT2016 German-English

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Transformer Cycle (Rev)	BLEU score	35.14	—	Unverified
2	Noisy back-translation	BLEU score	35	—	Unverified
3	Transformer+Rep(Uni)	BLEU score	33.89	—	Unverified
4	T5-11B	BLEU score	32.1	—	Unverified
5	BiBERT	BLEU score	31.26	—	Unverified
6	Transformer + R-Drop	BLEU score	30.91	—	Unverified
7	Bi-SimCut	BLEU score	30.78	—	Unverified
8	BERT-fused NMT	BLEU score	30.75	—	Unverified
9	Data Diversification - Transformer	BLEU score	30.7	—	Unverified
10	SimCut	BLEU score	30.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer+BT (ADMIN init)	BLEU score	46.4	—	Unverified
2	Noisy back-translation	BLEU score	45.6	—	Unverified
3	mRASP+Fine-Tune	BLEU score	44.3	—	Unverified
4	Transformer + R-Drop	BLEU score	43.95	—	Unverified
5	Transformer (ADMIN init)	BLEU score	43.8	—	Unverified
6	Admin	BLEU score	43.8	—	Unverified
7	BERT-fused NMT	BLEU score	43.78	—	Unverified
8	MUSE(Paralllel Multi-scale Attention)	BLEU score	43.5	—	Unverified
9	T5	BLEU score	43.4	—	Unverified
10	Local Joint Self-attention	BLEU score	43.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PiNMT	BLEU score	40.43	—	Unverified
2	BiBERT	BLEU score	38.61	—	Unverified
3	Bi-SimCut	BLEU score	38.37	—	Unverified
4	Cutoff + Relaxed Attention + LM	BLEU score	37.96	—	Unverified
5	DRDA	BLEU score	37.95	—	Unverified
6	Transformer + R-Drop + Cutoff	BLEU score	37.9	—	Unverified
7	SimCut	BLEU score	37.81	—	Unverified
8	Cutoff+Knee	BLEU score	37.78	—	Unverified
9	Cutoff	BLEU score	37.6	—	Unverified
10	CipherDAug	BLEU score	37.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HWTSC-Teacher-Sim	Score	19.97	—	Unverified
2	MS-COMET-22	Score	19.89	—	Unverified
3	MS-COMET-QE-22	Score	19.76	—	Unverified
4	KG-BERTScore	Score	17.28	—	Unverified
5	metricx_xl_DA_2019	Score	17.17	—	Unverified
6	COMET-QE	Score	16.8	—	Unverified
7	COMET-22	Score	16.31	—	Unverified
8	UniTE-src	Score	15.68	—	Unverified
9	UniTE-ref	Score	15.38	—	Unverified
10	metricx_xxl_DA_2019	Score	15.24	—	Unverified