Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10501–10550 of 10752 papers

Title	Date	Tasks	Status
We Need to Talk About Classification Evaluation Metrics in NLP	Jan 8, 2024	DiversityMachine Translation	—Unverified
WERd: Using Social Text Spelling Variants for Evaluating Dialectal Speech Recognition	Sep 21, 2017	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Were the clocks striking or surprising? Using WSD to improve MT performance	Apr 1, 2012	Machine TranslationWord Sense Disambiguation	—Unverified
The Impact of Preprocessing on Arabic-English Statistical and Neural Machine Translation	Jun 27, 2019	Machine TranslationTranslation	—Unverified
WeTS: A Benchmark for Translation Suggestion	Nov 16, 2021	Machine TranslationTranslation	—Unverified
What about em? How Commercial Machine Translation Fails to Handle (Neo-)Pronouns	May 25, 2023	Machine TranslationTranslation	—Unverified
On Systematic Style Differences between Unsupervised and Supervised MT and an Application for High-Resource Machine Translation	Jun 30, 2021	Machine TranslationTranslation	—Unverified
YiSi - a Unified Semantic MT Quality Evaluation and Estimation Metric for Languages with Different Levels of Available Resources	Aug 1, 2019	Machine TranslationSemantic Similarity	—Unverified
YNU\_Deep at SemEval-2018 Task 11: An Ensemble of Attention-based BiLSTM Models for Machine Comprehension	Jun 1, 2018	Machine TranslationReading Comprehension	—Unverified
What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects	Feb 19, 2024	Machine Translation	—Unverified
What does Attention in Neural Machine Translation Pay Attention to?	Oct 9, 2017	Machine TranslationSentence	—Unverified
YNU Deep at SemEval-2018 Task 12: A BiLSTM Model with Neural Attention for Argument Reasoning Comprehension	Jun 1, 2018	Constituency ParsingLanguage Modeling	—Unverified
The Impact of Multiword Expression Compositionality on Machine Translation Evaluation	Jun 1, 2015	Document RankingInformation Retrieval	—Unverified
What do RNN Language Models Learn about Filler--Gap Dependencies?	Nov 1, 2018	Language ModelingLanguage Modelling	—Unverified
What Do You Get When You Cross Beam Search with Nucleus Sampling?	Jul 20, 2021	Machine TranslationText Generation	—Unverified
What good are `Nominalkomposita' for `noun compounds': Multilingual Extraction and Structure Analysis of Nominal Compositions using Linguistic Restrictors	Aug 1, 2014	Machine Translation	—Unverified
What is Hidden among Translation Rules	Oct 1, 2013	Machine TranslationTranslation	—Unverified
What is it? Disambiguating the different readings of the pronoun `it'	Sep 1, 2017	coreference-resolutionCoreference Resolution	—Unverified
The Impact of Model Scaling on Seen and Unseen Language Performance	Jan 10, 2025	Machine TranslationMultilingual text classification	—Unverified
What is the Best Way for ChatGPT to Translate Poetry?	Jun 5, 2024	Machine TranslationTranslation	—Unverified
What Level of Quality can Neural Machine Translation Attain on Literary Text?	Jan 15, 2018	Machine TranslationNMT	—Unverified
Could We Have Had Better Multilingual LLMs If English Was Not the Central Language?	Feb 21, 2024	Machine TranslationTranslation	—Unverified
What Makes a Good Story and How Can We Measure It? A Comprehensive Survey of Story Evaluation	Aug 26, 2024	Machine Translation	—Unverified
What Makes Word-level Neural Machine Translation Hard: A Case Study on English-German Translation	Dec 1, 2016	Feature EngineeringMachine Translation	—Unverified
What Matters Most in Morphologically Segmented SMT Models?	Jun 1, 2015	Machine TranslationTransliteration	—Unverified
What Role Does BERT Play in the Neural Machine Translation Encoder?	Jan 16, 2022	Machine TranslationNMT	—Unverified
YODA System for WMT16 Shared Task: Bilingual Document Alignment	Aug 1, 2016	Machine Translation	—Unverified
You Cannot Feed Two Birds with One Score: the Accuracy-Naturalness Tradeoff in Translation	Mar 31, 2025	Machine TranslationTranslation	—Unverified
What's in a Domain? Analyzing Genre and Topic Differences in Statistical Machine Translation	Jul 1, 2015	Domain AdaptationMachine Translation	—Unverified
What's in an Embedding? Analyzing Word Embeddings through Multilingual Evaluation	Sep 1, 2015	Machine TranslationMorphological Analysis	—Unverified
What's the Difference Between Professional Human and Machine Translation? A Blind Multi-language Study on Domain-specific MT	Jun 8, 2020	Machine TranslationTranslation	—Unverified
What’s the Difference Between Professional Human and Machine Translation? A Blind Multi-language Study on Domain-specific MT	Nov 1, 2020	Machine TranslationTranslation	—Unverified
You Need to Pay Better Attention: Rethinking the Mathematics of Attention Mechanism	Mar 3, 2024	Machine TranslationMathematical Reasoning	—Unverified
What we need to learn if we want to do and not just talk	Jun 1, 2018	ChatbotMachine Translation	—Unverified
What Works and Doesn't Work, A Deep Decoder for Neural Machine Translation	Nov 16, 2021	DecoderLanguage Modelling	—Unverified
What Works and Doesn’t Work, A Deep Decoder for Neural Machine Translation	May 1, 2022	DecoderLanguage Modelling	—Unverified
What you can cram into a single \$\&!\#* vector: Probing sentence embeddings for linguistic properties	Jul 1, 2018	General ClassificationMachine Translation	—Unverified
The Impact of Machine Translation Quality on Human Post-Editing	Apr 1, 2014	Machine TranslationTranslation	—Unverified
When and why are log-linear models self-normalizing?	May 1, 2015	Computational EfficiencyGeneralization Bounds	—Unverified
Your Autoregressive Generative Model Can be Better If You Treat It as an Energy-Based One	Jun 26, 2022	Image GenerationLanguage Modeling	—Unverified
``You Sound Just Like Your Father'' Commercial Machine Translation Systems Include Stylistic Biases	Jul 1, 2020	Machine TranslationTranslation	—Unverified
When and Why is Unsupervised Neural Machine Translation Useless?	Apr 22, 2020	Machine TranslationNMT	—Unverified
The Impact of Indirect Machine Translation on Sentiment Classification	Aug 25, 2020	ClassificationGeneral Classification	—Unverified
When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?	Jan 16, 2022	Machine TranslationNMT	—Unverified
When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?	Apr 26, 2022	Machine TranslationNMT	—Unverified
You’ve translated it, now what?	Sep 1, 2022	Machine TranslationOptical Character Recognition (OCR)	—Unverified
When does deep multi-task learning work for loosely related document classification tasks?	Nov 1, 2018	Document ClassificationGeneral Classification	—Unverified
When Does Monolingual Data Help Multilingual Translation: The Role of Domain and Model Scale	May 23, 2023	DenoisingMachine Translation	—Unverified
When does Parameter-Efficient Transfer Learning Work for Machine Translation?	Jan 16, 2022	Machine TranslationTransfer Learning	—Unverified
YSDA Participation in the WMT'16 Quality Estimation Shared Task	Aug 1, 2016	Machine Translation	—Unverified

Show:10 25 50

← PrevPage 211 of 216Next →

All datasets WMT2014 English-German WMT2014 English-French IWSLT2014 German-English ACES WMT2016 English-Romanian WMT2016 Romanian-English WMT2014 German-English IWSLT2015 German-English WMT2016 English-German IWSLT2015 English-Vietnamese IWSLT2015 English-German WMT2016 German-English

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Transformer Cycle (Rev)	BLEU score	35.14	—	Unverified
2	Noisy back-translation	BLEU score	35	—	Unverified
3	Transformer+Rep(Uni)	BLEU score	33.89	—	Unverified
4	T5-11B	BLEU score	32.1	—	Unverified
5	BiBERT	BLEU score	31.26	—	Unverified
6	Transformer + R-Drop	BLEU score	30.91	—	Unverified
7	Bi-SimCut	BLEU score	30.78	—	Unverified
8	BERT-fused NMT	BLEU score	30.75	—	Unverified
9	Data Diversification - Transformer	BLEU score	30.7	—	Unverified
10	SimCut	BLEU score	30.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer+BT (ADMIN init)	BLEU score	46.4	—	Unverified
2	Noisy back-translation	BLEU score	45.6	—	Unverified
3	mRASP+Fine-Tune	BLEU score	44.3	—	Unverified
4	Transformer + R-Drop	BLEU score	43.95	—	Unverified
5	Admin	BLEU score	43.8	—	Unverified
6	Transformer (ADMIN init)	BLEU score	43.8	—	Unverified
7	BERT-fused NMT	BLEU score	43.78	—	Unverified
8	MUSE(Paralllel Multi-scale Attention)	BLEU score	43.5	—	Unverified
9	T5	BLEU score	43.4	—	Unverified
10	Local Joint Self-attention	BLEU score	43.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PiNMT	BLEU score	40.43	—	Unverified
2	BiBERT	BLEU score	38.61	—	Unverified
3	Bi-SimCut	BLEU score	38.37	—	Unverified
4	Cutoff + Relaxed Attention + LM	BLEU score	37.96	—	Unverified
5	DRDA	BLEU score	37.95	—	Unverified
6	Transformer + R-Drop + Cutoff	BLEU score	37.9	—	Unverified
7	SimCut	BLEU score	37.81	—	Unverified
8	Cutoff+Knee	BLEU score	37.78	—	Unverified
9	Cutoff	BLEU score	37.6	—	Unverified
10	CipherDAug	BLEU score	37.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HWTSC-Teacher-Sim	Score	19.97	—	Unverified
2	MS-COMET-22	Score	19.89	—	Unverified
3	MS-COMET-QE-22	Score	19.76	—	Unverified
4	KG-BERTScore	Score	17.28	—	Unverified
5	metricx_xl_DA_2019	Score	17.17	—	Unverified
6	COMET-QE	Score	16.8	—	Unverified
7	COMET-22	Score	16.31	—	Unverified
8	UniTE-src	Score	15.68	—	Unverified
9	UniTE-ref	Score	15.38	—	Unverified
10	metricx_xxl_DA_2019	Score	15.24	—	Unverified