Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 10752 papers

Title	Date	Tasks	Status	Hype
OpenICL: An Open-Source Framework for In-context Learning	Mar 6, 2023	In-Context LearningLanguage Modeling	CodeCode Available	2
Inseq: An Interpretability Toolkit for Sequence Generation Models	Feb 27, 2023	DecoderFeature Importance	CodeCode Available	2
Binarized Neural Machine Translation	Feb 9, 2023	BinarizationMachine Translation	CodeCode Available	2
Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine	Jan 20, 2023	Machine TranslationSentence	CodeCode Available	2
Democratizing Neural Machine Translation with OPUS-MT	Dec 4, 2022	Machine TranslationTranslation	CodeCode Available	2
Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings	Oct 23, 2022	Cross-Lingual NERCross-Lingual Transfer	CodeCode Available	2
Mega: Moving Average Equipped Gated Attention	Sep 21, 2022	Image ClassificationInductive Bias	CodeCode Available	2
AlexaTM 20B: Few-Shot Learning Using a Large-Scale Multilingual Seq2Seq Model	Aug 2, 2022	Causal Language ModelingCommon Sense Reasoning	CodeCode Available	2
No Language Left Behind: Scaling Human-Centered Machine Translation	Jul 11, 2022	Machine TranslationMixture-of-Experts	CodeCode Available	2
Shifts 2.0: Extending The Dataset of Real Distributional Shifts	Jun 30, 2022	Autonomous Drivingimage-classification	CodeCode Available	2
Cross-lingual and Multilingual CLIP	Jun 1, 2022	Contrastive LearningImage-text Retrieval	CodeCode Available	2
CoNT: Contrastive Neural Text Generation	May 29, 2022	Code Comment GenerationComment Generation	CodeCode Available	2
Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation	May 25, 2022	Cross-Lingual TransferMachine Translation	CodeCode Available	2
Automated Deep Learning: Neural Architecture Search Is Not the End	Dec 16, 2021	Deep LearningMachine Translation	CodeCode Available	2
LightSeq2: Accelerated Training for Transformer-based Models on GPUs	Oct 12, 2021	DecoderGPU	CodeCode Available	2
When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute	Feb 24, 2021	GPULanguage Modeling	CodeCode Available	2
LightSeq: A High Performance Inference Library for Transformers	Oct 23, 2020	GPUMachine Translation	CodeCode Available	2
TextAttack: A Framework for Adversarial Attacks, Data Augmentation, and Adversarial Training in NLP	Apr 29, 2020	Adversarial AttackAdversarial Text	CodeCode Available	2
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer	Oct 23, 2019	Answer GenerationCommon Sense Reasoning	CodeCode Available	2
MASS: Masked Sequence to Sequence Pre-training for Language Generation	May 7, 2019	Conversational Response GenerationDecoder	CodeCode Available	2
GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism	Nov 16, 2018	Fine-Grained Image Classificationimage-classification	CodeCode Available	2
Neural Speech Synthesis with Transformer Network	Sep 19, 2018	DecoderMachine Translation	CodeCode Available	2
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation	Sep 4, 2018	Machine TranslationText Generation	CodeCode Available	2
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation	Apr 26, 2018	Machine TranslationTranslation	CodeCode Available	2
Simple Recurrent Units for Highly Parallelizable Recurrence	Sep 8, 2017	General ClassificationMachine Translation	CodeCode Available	2
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer	Jan 23, 2017	Computational EfficiencyGPU	CodeCode Available	2
TACTIC: Translation Agents with Cognitive-Theoretic Interactive Collaboration	Jun 10, 2025	Machine TranslationTranslation	CodeCode Available	1
Universal Reasoner: A Single, Composable Plug-and-Play Reasoner for Frozen LLMs	May 25, 2025	Machine TranslationMathematical Reasoning	CodeCode Available	1
MEDIBENG WHISPER TINY: A FINE-TUNED CODE-SWITCHED BENGALI-ENGLISH TRANSLATOR FOR CLINICAL APPLICATIONS	Apr 25, 2025	Clinical Language TranslationMachine Translation	CodeCode Available	1
Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling	Apr 18, 2025	Machine TranslationTranslation	CodeCode Available	1
Sun-Shine: A Large Language Model for Tibetan Culture	Mar 24, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions	Mar 20, 2025	2D Object DetectionDistributed Computing	CodeCode Available	1
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation	Mar 9, 2025	DecoderMachine Translation	CodeCode Available	1
Automatic Input Rewriting Improves Translation with Large Language Models	Feb 23, 2025	Machine TranslationText Simplification	CodeCode Available	1
Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs	Feb 20, 2025	Cross-Lingual TransferMachine Translation	CodeCode Available	1
Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu	Feb 17, 2025	Data AugmentationIn-Context Learning	CodeCode Available	1
TUMLU: A Unified and Native Language Understanding Benchmark for Turkic Languages	Feb 16, 2025	Machine TranslationMMLU	CodeCode Available	1
How to Select Datapoints for Efficient Human Evaluation of NLG Models?	Jan 30, 2025	HumanEvalMachine Translation	CodeCode Available	1
Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages	Jan 10, 2025	Machine Translation	CodeCode Available	1
Merging Feed-Forward Sublayers for Compressed Transformers	Jan 10, 2025	image-classificationImage Classification	CodeCode Available	1
Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation	Jan 6, 2025	Machine TranslationTranslation	CodeCode Available	1
M-MAD: Multidimensional Multi-Agent Debate Framework for Fine-grained Machine Translation Evaluation	Dec 28, 2024	Machine Translation	CodeCode Available	1
Property Enhanced Instruction Tuning for Multi-task Molecule Generation with Large Language Models	Dec 24, 2024	Machine TranslationMolecular Property Prediction	CodeCode Available	1
MT-LENS: An all-in-one Toolkit for Better Machine Translation Evaluation	Dec 16, 2024	AllBenchmarking	CodeCode Available	1
Retrieval-Augmented Machine Translation with Unstructured Knowledge	Dec 5, 2024	Knowledge GraphsMachine Translation	CodeCode Available	1
Context-Informed Machine Translation of Manga using Multimodal Large Language Models	Nov 4, 2024	Machine TranslationTranslation	CodeCode Available	1
MetaMetrics-MT: Tuning Meta-Metrics for Machine Translation via Human Preference Calibration	Nov 1, 2024	Bayesian OptimizationGaussian Processes	CodeCode Available	1
Fine-Grained and Multi-Dimensional Metrics for Document-Level Machine Translation	Oct 28, 2024	Document Level Machine TranslationMachine Translation	CodeCode Available	1
How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs	Oct 24, 2024	2kMachine Translation	CodeCode Available	1
MQM-APE: Toward High-Quality Error Annotation Predictors with Automatic Post-Editing in LLM Translation Evaluators	Sep 22, 2024	Automatic Post-EditingMachine Translation	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 216Next →

All datasets WMT2014 English-German WMT2014 English-French IWSLT2014 German-English ACES WMT2016 English-Romanian WMT2016 Romanian-English WMT2014 German-English IWSLT2015 German-English WMT2016 English-German IWSLT2015 English-Vietnamese IWSLT2015 English-German WMT2016 German-English

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Transformer Cycle (Rev)	BLEU score	35.14	—	Unverified
2	Noisy back-translation	BLEU score	35	—	Unverified
3	Transformer+Rep(Uni)	BLEU score	33.89	—	Unverified
4	T5-11B	BLEU score	32.1	—	Unverified
5	BiBERT	BLEU score	31.26	—	Unverified
6	Transformer + R-Drop	BLEU score	30.91	—	Unverified
7	Bi-SimCut	BLEU score	30.78	—	Unverified
8	BERT-fused NMT	BLEU score	30.75	—	Unverified
9	Data Diversification - Transformer	BLEU score	30.7	—	Unverified
10	SimCut	BLEU score	30.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer+BT (ADMIN init)	BLEU score	46.4	—	Unverified
2	Noisy back-translation	BLEU score	45.6	—	Unverified
3	mRASP+Fine-Tune	BLEU score	44.3	—	Unverified
4	Transformer + R-Drop	BLEU score	43.95	—	Unverified
5	Admin	BLEU score	43.8	—	Unverified
6	Transformer (ADMIN init)	BLEU score	43.8	—	Unverified
7	BERT-fused NMT	BLEU score	43.78	—	Unverified
8	MUSE(Paralllel Multi-scale Attention)	BLEU score	43.5	—	Unverified
9	T5	BLEU score	43.4	—	Unverified
10	Local Joint Self-attention	BLEU score	43.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PiNMT	BLEU score	40.43	—	Unverified
2	BiBERT	BLEU score	38.61	—	Unverified
3	Bi-SimCut	BLEU score	38.37	—	Unverified
4	Cutoff + Relaxed Attention + LM	BLEU score	37.96	—	Unverified
5	DRDA	BLEU score	37.95	—	Unverified
6	Transformer + R-Drop + Cutoff	BLEU score	37.9	—	Unverified
7	SimCut	BLEU score	37.81	—	Unverified
8	Cutoff+Knee	BLEU score	37.78	—	Unverified
9	Cutoff	BLEU score	37.6	—	Unverified
10	CipherDAug	BLEU score	37.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HWTSC-Teacher-Sim	Score	19.97	—	Unverified
2	MS-COMET-22	Score	19.89	—	Unverified
3	MS-COMET-QE-22	Score	19.76	—	Unverified
4	KG-BERTScore	Score	17.28	—	Unverified
5	metricx_xl_DA_2019	Score	17.17	—	Unverified
6	COMET-QE	Score	16.8	—	Unverified
7	COMET-22	Score	16.31	—	Unverified
8	UniTE-src	Score	15.68	—	Unverified
9	UniTE-ref	Score	15.38	—	Unverified
10	metricx_xxl_DA_2019	Score	15.24	—	Unverified