Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 10752 papers

Title	Date	Tasks	Status	Hype
No Language Left Behind: Scaling Human-Centered Machine Translation	Jul 11, 2022	Machine TranslationMixture-of-Experts	CodeCode Available	2
OpenICL: An Open-Source Framework for In-context Learning	Mar 6, 2023	In-Context LearningLanguage Modeling	CodeCode Available	2
GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators	Feb 10, 2024	Machine TranslationSpeech-to-Speech Translation	CodeCode Available	2
OWL: A Large Language Model for IT Operations	Sep 17, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer	Oct 23, 2019	Answer GenerationCommon Sense Reasoning	CodeCode Available	2
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate	May 30, 2023	Arithmetic ReasoningMachine Translation	CodeCode Available	2
Shifts 2.0: Extending The Dataset of Real Distributional Shifts	Jun 30, 2022	Autonomous Drivingimage-classification	CodeCode Available	2
Simple Recurrent Units for Highly Parallelizable Recurrence	Sep 8, 2017	General ClassificationMachine Translation	CodeCode Available	2
Enhancing Taiwanese Hokkien Dual Translation by Exploring and Standardizing of Four Writing Systems	Mar 18, 2024	Machine TranslationTranslation	CodeCode Available	2
TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP	Apr 29, 2020	Adversarial AttackAdversarial Text	CodeCode Available	2
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation	Apr 26, 2018	Machine TranslationTranslation	CodeCode Available	2
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism	Nov 16, 2018	Fine-Grained Image Classificationimage-classification	CodeCode Available	2
Democratizing Neural Machine Translation with OPUS-MT	Dec 4, 2022	Machine TranslationTranslation	CodeCode Available	2
CoNT: Contrastive Neural Text Generation	May 29, 2022	Code Comment GenerationComment Generation	CodeCode Available	2
Binarized Neural Machine Translation	Feb 9, 2023	BinarizationMachine Translation	CodeCode Available	2
BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages	May 29, 2023	Machine TranslationTranslation	CodeCode Available	2
Automated Deep Learning: Neural Architecture Search Is Not the End	Dec 16, 2021	Deep LearningMachine Translation	CodeCode Available	2
CATT: Character-based Arabic Tashkeel Transformer	Jul 3, 2024	Arabic Text DiacritizationDecoder	CodeCode Available	2
Cross-lingual and Multilingual CLIP	Jun 1, 2022	Contrastive LearningImage-text Retrieval	CodeCode Available	2
DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory	Oct 10, 2024	Document TranslationMachine Translation	CodeCode Available	2
Efficient Minimum Bayes Risk Decoding using Low-Rank Matrix Completion Algorithms	Jun 5, 2024	Low-Rank Matrix CompletionMachine Translation	CodeCode Available	2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation	May 19, 2023	HallucinationMachine Translation	CodeCode Available	2
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model	Aug 2, 2022	Causal Language ModelingCommon Sense Reasoning	CodeCode Available	2
Exploring Human-Like Translation Strategy with Large Language Models	May 6, 2023	HallucinationMachine Translation	CodeCode Available	2
MIND Your Language: A Multilingual Dataset for Cross-lingual News Recommendation	Mar 26, 2024	Cross-Lingual TransferLanguage Modelling	CodeCode Available	2

Show:10 25 50

← PrevPage 3 of 431Next →

All datasets WMT2014 English-German WMT2014 English-French IWSLT2014 German-English ACES WMT2016 English-Romanian WMT2016 Romanian-English WMT2014 German-English IWSLT2015 German-English WMT2016 English-German IWSLT2015 English-Vietnamese IWSLT2015 English-German WMT2016 German-English

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Transformer Cycle (Rev)	BLEU score	35.14	—	Unverified
2	Noisy back-translation	BLEU score	35	—	Unverified
3	Transformer+Rep(Uni)	BLEU score	33.89	—	Unverified
4	T5-11B	BLEU score	32.1	—	Unverified
5	BiBERT	BLEU score	31.26	—	Unverified
6	Transformer + R-Drop	BLEU score	30.91	—	Unverified
7	Bi-SimCut	BLEU score	30.78	—	Unverified
8	BERT-fused NMT	BLEU score	30.75	—	Unverified
9	Data Diversification - Transformer	BLEU score	30.7	—	Unverified
10	SimCut	BLEU score	30.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer+BT (ADMIN init)	BLEU score	46.4	—	Unverified
2	Noisy back-translation	BLEU score	45.6	—	Unverified
3	mRASP+Fine-Tune	BLEU score	44.3	—	Unverified
4	Transformer + R-Drop	BLEU score	43.95	—	Unverified
5	Transformer (ADMIN init)	BLEU score	43.8	—	Unverified
6	Admin	BLEU score	43.8	—	Unverified
7	BERT-fused NMT	BLEU score	43.78	—	Unverified
8	MUSE(Paralllel Multi-scale Attention)	BLEU score	43.5	—	Unverified
9	T5	BLEU score	43.4	—	Unverified
10	Local Joint Self-attention	BLEU score	43.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PiNMT	BLEU score	40.43	—	Unverified
2	BiBERT	BLEU score	38.61	—	Unverified
3	Bi-SimCut	BLEU score	38.37	—	Unverified
4	Cutoff + Relaxed Attention + LM	BLEU score	37.96	—	Unverified
5	DRDA	BLEU score	37.95	—	Unverified
6	Transformer + R-Drop + Cutoff	BLEU score	37.9	—	Unverified
7	SimCut	BLEU score	37.81	—	Unverified
8	Cutoff+Knee	BLEU score	37.78	—	Unverified
9	Cutoff	BLEU score	37.6	—	Unverified
10	CipherDAug	BLEU score	37.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HWTSC-Teacher-Sim	Score	19.97	—	Unverified
2	MS-COMET-22	Score	19.89	—	Unverified
3	MS-COMET-QE-22	Score	19.76	—	Unverified
4	KG-BERTScore	Score	17.28	—	Unverified
5	metricx_xl_DA_2019	Score	17.17	—	Unverified
6	COMET-QE	Score	16.8	—	Unverified
7	COMET-22	Score	16.31	—	Unverified
8	UniTE-src	Score	15.68	—	Unverified
9	UniTE-ref	Score	15.38	—	Unverified
10	metricx_xxl_DA_2019	Score	15.24	—	Unverified