Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 10752 papers

Title	Date	Tasks	Status	Hype
Do Language Models Understand Honorific Systems in Javanese?	Feb 28, 2025	Machine TranslationTranslation	—Unverified	0
Arabizi vs LLMs: Can the Genie Understand the Language of Aladdin?	Feb 28, 2025	Machine Translation	—Unverified	0
Connecting the Persian-speaking World through Transliteration	Feb 27, 2025	Machine TranslationTransliteration	—Unverified	0
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning	Feb 27, 2025	Domain AdaptationMachine Translation	—Unverified	0
Alleviating Distribution Shift in Synthetic Data for Machine Translation Quality Estimation	Feb 27, 2025	Machine TranslationSynthetic Data Generation	—Unverified	0
Evaluating Gender Bias in German Machine Translation	Feb 26, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
Enhancing Human Evaluation in Machine Translation with Comparative Judgment	Feb 25, 2025	Machine TranslationTranslation	—Unverified	0
UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings	Feb 24, 2025	DiversityInstruction Following	—Unverified	0
Using Machine Learning to Detect Fraudulent SMSs in Chichewa	Feb 24, 2025	Fraud DetectionMachine Translation	—Unverified	0
Automatic Input Rewriting Improves Translation with Large Language Models	Feb 23, 2025	Machine TranslationText Simplification	CodeCode Available	1
LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models	Feb 21, 2025	Machine TranslationMamba	—Unverified	0
Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs	Feb 20, 2025	Cross-Lingual TransferMachine Translation	CodeCode Available	1
Early-Exit and Instant Confidence Translation Quality Estimation	Feb 20, 2025	Machine TranslationReranking	CodeCode Available	0
Effects of Prompt Length on Domain-specific Tasks for Large Language Models	Feb 20, 2025	Machine TranslationPrompt Engineering	—Unverified	0
English Please: Evaluating Machine Translation with Large Language Models for Multilingual Bug Reports	Feb 20, 2025	Domain AdaptationLanguage Identification	CodeCode Available	0
MultiSlav: Using Cross-Lingual Knowledge Transfer to Combat the Curse of Multilinguality	Feb 20, 2025	Machine TranslationNMT	—Unverified	0
Non-Euclidean Hierarchical Representational Learning using Hyperbolic Graph Neural Networks for Environmental Claim Detection	Feb 19, 2025	Claim VerificationDependency Parsing	—Unverified	0
Translation in the Hands of Many:Centering Lay Users in Machine Translation Interactions	Feb 19, 2025	Machine TranslationTranslation	—Unverified	0
WMT24++: Expanding the Language Coverage of WMT24 to 55 Languages & Dialects	Feb 18, 2025	Machine Translation	CodeCode Available	2
How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild	Feb 18, 2025	ArticlesHallucination	CodeCode Available	0
Translate Smart, not Hard: Cascaded Translation Systems with Quality-Aware Deferral	Feb 18, 2025	Machine Translation	—Unverified	0
LMN: A Tool for Generating Machine Enforceable Policies from Natural Language Access Control Rules using LLMs	Feb 18, 2025	AttributeMachine Translation	—Unverified	0
Efficient Machine Translation Corpus Generation: Integrating Human-in-the-Loop Post-Editing with Large Language Models	Feb 18, 2025	Machine TranslationTranslation	CodeCode Available	0
Evaluating o1-Like LLMs: Unlocking Reasoning for Translation through Comprehensive Analysis	Feb 17, 2025	Machine TranslationTranslation	—Unverified	0
Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu	Feb 17, 2025	Data AugmentationIn-Context Learning	CodeCode Available	1
Identifying Gender Stereotypes and Biases in Automated Translation from English to Italian using Similarity Networks	Feb 17, 2025	Machine TranslationTranslation	—Unverified	0
Asymmetric Conflict and Synergy in Post-training for LLM-based Multilingual Machine Translation	Feb 16, 2025	Machine Translation	—Unverified	0
TUMLU: A Unified and Native Language Understanding Benchmark for Turkic Languages	Feb 16, 2025	Machine TranslationMMLU	CodeCode Available	1
ANCHOLIK-NER: A Benchmark Dataset for Bangla Regional Named Entity Recognition	Feb 16, 2025	ArticlesInformation Retrieval	—Unverified	0
Injecting Domain-Specific Knowledge into Large Language Models: A Comprehensive Survey	Feb 15, 2025	Machine TranslationNatural Language Understanding	CodeCode Available	0
Truth Knows No Language: Evaluating Truthfulness Beyond English	Feb 13, 2025	InformativenessMachine Translation	CodeCode Available	0
Unsupervised Translation of Emergent Communication	Feb 11, 2025	DiversityMachine Translation	—Unverified	0
Evaluating Text Style Transfer Evaluation: Are There Any Reliable Metrics?	Feb 7, 2025	Machine TranslationStyle Transfer	—Unverified	0
Uncertainty Quantification for LLMs through Minimum Bayes Risk: Bridging Confidence and Consistency	Feb 7, 2025	Abstractive Text SummarizationMachine Translation	—Unverified	0
BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation	Feb 6, 2025	Machine TranslationSentence	—Unverified	0
Multilingual Non-Autoregressive Machine Translation without Knowledge Distillation	Feb 6, 2025	Knowledge DistillationMachine Translation	CodeCode Available	0
DOLFIN -- Document-Level Financial test set for Machine Translation	Feb 5, 2025	Document Level Machine TranslationMachine Translation	—Unverified	0
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study	Feb 4, 2025	Continual PretrainingMachine Translation	—Unverified	0
A comparison of translation performance between DeepL and Supertext	Feb 4, 2025	BenchmarkingMachine Translation	CodeCode Available	0
Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study	Feb 4, 2025	Machine TranslationTransfer Learning	CodeCode Available	0
When End-to-End is Overkill: Rethinking Cascaded Speech-to-Text Translation	Feb 1, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
An Efficient Approach for Machine Translation on Low-resource Languages: A Case Study in Vietnamese-Chinese	Jan 31, 2025	Language ModelingLanguage Modelling	—Unverified	0
Brain-inspired sparse training enables Transformers and LLMs to perform as fully connected	Jan 31, 2025	GPULanguage Modeling	—Unverified	0
How to Select Datapoints for Efficient Human Evaluation of NLG Models?	Jan 30, 2025	HumanEvalMachine Translation	CodeCode Available	1
Overestimation in LLM Evaluation: A Controlled Large-Scale Study on Data Contamination's Impact on Machine Translation	Jan 30, 2025	Machine Translation	—Unverified	0
Cross-Language Approach for Quranic QA	Jan 29, 2025	Machine TranslationQuestion Answering	—Unverified	0
Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization	Jan 28, 2025	DecoderHallucination	—Unverified	0
Misspellings in Natural Language Processing: A survey	Jan 28, 2025	Data AugmentationMachine Translation	—Unverified	0
Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems	Jan 28, 2025	Ensemble LearningHallucination	—Unverified	0
DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models	Jan 27, 2025	Machine Translation	—Unverified	0

Show:10 25 50

← PrevPage 4 of 216Next →

All datasets WMT2014 English-German WMT2014 English-French IWSLT2014 German-English ACES WMT2016 English-Romanian WMT2016 Romanian-English WMT2014 German-English IWSLT2015 German-English WMT2016 English-German IWSLT2015 English-Vietnamese IWSLT2015 English-German WMT2016 German-English

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Transformer Cycle (Rev)	BLEU score	35.14	—	Unverified
2	Noisy back-translation	BLEU score	35	—	Unverified
3	Transformer+Rep(Uni)	BLEU score	33.89	—	Unverified
4	T5-11B	BLEU score	32.1	—	Unverified
5	BiBERT	BLEU score	31.26	—	Unverified
6	Transformer + R-Drop	BLEU score	30.91	—	Unverified
7	Bi-SimCut	BLEU score	30.78	—	Unverified
8	BERT-fused NMT	BLEU score	30.75	—	Unverified
9	Data Diversification - Transformer	BLEU score	30.7	—	Unverified
10	SimCut	BLEU score	30.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer+BT (ADMIN init)	BLEU score	46.4	—	Unverified
2	Noisy back-translation	BLEU score	45.6	—	Unverified
3	mRASP+Fine-Tune	BLEU score	44.3	—	Unverified
4	Transformer + R-Drop	BLEU score	43.95	—	Unverified
5	Admin	BLEU score	43.8	—	Unverified
6	Transformer (ADMIN init)	BLEU score	43.8	—	Unverified
7	BERT-fused NMT	BLEU score	43.78	—	Unverified
8	MUSE(Paralllel Multi-scale Attention)	BLEU score	43.5	—	Unverified
9	T5	BLEU score	43.4	—	Unverified
10	Local Joint Self-attention	BLEU score	43.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PiNMT	BLEU score	40.43	—	Unverified
2	BiBERT	BLEU score	38.61	—	Unverified
3	Bi-SimCut	BLEU score	38.37	—	Unverified
4	Cutoff + Relaxed Attention + LM	BLEU score	37.96	—	Unverified
5	DRDA	BLEU score	37.95	—	Unverified
6	Transformer + R-Drop + Cutoff	BLEU score	37.9	—	Unverified
7	SimCut	BLEU score	37.81	—	Unverified
8	Cutoff+Knee	BLEU score	37.78	—	Unverified
9	Cutoff	BLEU score	37.6	—	Unverified
10	CipherDAug	BLEU score	37.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HWTSC-Teacher-Sim	Score	19.97	—	Unverified
2	MS-COMET-22	Score	19.89	—	Unverified
3	MS-COMET-QE-22	Score	19.76	—	Unverified
4	KG-BERTScore	Score	17.28	—	Unverified
5	metricx_xl_DA_2019	Score	17.17	—	Unverified
6	COMET-QE	Score	16.8	—	Unverified
7	COMET-22	Score	16.31	—	Unverified
8	UniTE-src	Score	15.68	—	Unverified
9	UniTE-ref	Score	15.38	—	Unverified
10	metricx_xxl_DA_2019	Score	15.24	—	Unverified