Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1100 of 10752 papers

Title	Date	Tasks	Status	Hype
Towards General Error Diagnosis via Behavioral Testing in Machine Translation	Oct 20, 2023	Machine TranslationTranslation	CodeCode Available	0
Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning	Oct 20, 2023	In-Context LearningMachine Translation	—Unverified	0
Simultaneous Machine Translation with Tailored Reference	Oct 20, 2023	Machine TranslationSentence	—Unverified	0
Ask Language Model to Clean Your Noisy Translation Data	Oct 20, 2023	Language ModelingLanguage Modelling	—Unverified	0
A Use Case: Reformulating Query Rewriting as a Statistical Machine Translation Problem	Oct 19, 2023	Machine TranslationTranslation	—Unverified	0
Direct Neural Machine Translation with Task-level Mixture of Experts models	Oct 18, 2023	Direct NMTLarge Language Model	—Unverified	0
GRI: Graph-based Relative Isomorphism of Word Embedding Spaces	Oct 18, 2023	Machine Translation	CodeCode Available	0
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine Translation	Oct 18, 2023	FairnessFew-Shot Learning	CodeCode Available	0
knn-seq: Efficient, Extensible kNN-MT Framework	Oct 18, 2023	Machine TranslationNMT	CodeCode Available	1
Document-Level Language Models for Machine Translation	Oct 18, 2023	Language ModelingLanguage Modelling	—Unverified	0
Program Translation via Code Distillation	Oct 17, 2023	DiversityMachine Translation	—Unverified	0
IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems	Oct 17, 2023	Machine TranslationTranslation	CodeCode Available	0
An Empirical Study of Translation Hypothesis Ensembling with Large Language Models	Oct 17, 2023	DiversityMachine Translation	CodeCode Available	0
Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation	Oct 17, 2023	DecoderIn-Context Learning	—Unverified	0
Enhancing Neural Machine Translation with Semantic Units	Oct 17, 2023	Machine TranslationNMT	CodeCode Available	0
Long-form Simultaneous Speech Translation: Thesis Proposal	Oct 17, 2023	FormMachine Translation	—Unverified	0
xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error Detection	Oct 16, 2023	Machine TranslationSentence	CodeCode Available	1
Towards a Better Understanding of Variations in Zero-Shot Neural Machine Translation Performance	Oct 16, 2023	Machine TranslationNMT	CodeCode Available	0
UvA-MT's Participation in the WMT23 General Translation Shared Task	Oct 15, 2023	Machine TranslationTranslation	—Unverified	0
MILPaC: A Novel Benchmark for Evaluating Translation of Legal Text to Indian Languages	Oct 15, 2023	Machine TranslationTranslation	CodeCode Available	0
Attentive Multi-Layer Perceptron for Non-autoregressive Generation	Oct 14, 2023	Machine TranslationSpeech Synthesis	CodeCode Available	0
Human-in-the-loop Machine Translation with Large Language Model	Oct 13, 2023	In-Context LearningLanguage Modeling	CodeCode Available	0
Towards Example-Based NMT with Multi-Levenshtein Transformers	Oct 13, 2023	Domain AdaptationImitation Learning	CodeCode Available	0
Political claim identification and categorization in a multilingual setting: First experiments	Oct 13, 2023	Machine TranslationTranslation	—Unverified	0
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark	Oct 13, 2023	Dialogue EvaluationMachine Translation	CodeCode Available	0
Why bother with geometry? On the relevance of linear decompositions of Transformer embeddings	Oct 10, 2023	Machine TranslationSentence	CodeCode Available	0
Crossing the Threshold: Idiomatic Machine Translation through Retrieval Augmentation and Loss Weighting	Oct 10, 2023	4kMachine Translation	CodeCode Available	0
Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model	Oct 10, 2023	Machine TranslationNMT	—Unverified	0
In-Context Explainers: Harnessing LLMs for Explaining Black Box Models	Oct 9, 2023	Explainable artificial intelligenceExplainable Artificial Intelligence (XAI)	CodeCode Available	1
Larth: Dataset and Machine Translation for Etruscan	Oct 9, 2023	Machine TranslationTranslation	CodeCode Available	0
Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting	Oct 9, 2023	Language ModelingLanguage Modelling	—Unverified	0
Synslator: An Interactive Machine Translation Tool with Online Learning	Oct 8, 2023	Language ModelingLanguage Modelling	—Unverified	0
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code Translation	Oct 8, 2023	Code TranslationMachine Translation	CodeCode Available	1
Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE Corpus	Oct 8, 2023	BenchmarkingMachine Translation	CodeCode Available	0
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT	Oct 7, 2023	Audio captioningAutomatic Speech Recognition	CodeCode Available	2
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers	Oct 5, 2023	DecoderLogical Reasoning	CodeCode Available	0
Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns	Oct 3, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness	Oct 3, 2023	GPUMachine Translation	—Unverified	0
Nugget: Neural Agglomerative Embeddings of Text	Oct 3, 2023	Language ModelingLanguage Modelling	CodeCode Available	0
Necessary and Sufficient Watermark for Large Language Models	Oct 2, 2023	ArticlesMachine Translation	—Unverified	0
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models	Oct 2, 2023	Language ModelingLanguage Modelling	—Unverified	0
Quantifying the Plausibility of Context Reliance in Neural Machine Translation	Oct 2, 2023	Machine TranslationTranslation	CodeCode Available	2
Colloquial Persian POS (CPPOS) Corpus: A Novel Corpus for Colloquial Persian Part of Speech Tagging	Oct 1, 2023	Machine TranslationPart-Of-Speech Tagging	—Unverified	0
Sparse Backpropagation for MoE Training	Oct 1, 2023	Machine Translation	—Unverified	0
Unlikelihood Tuning on Negative Samples Amazingly Improves Zero-Shot Translation	Sep 28, 2023	Machine TranslationNavigate	CodeCode Available	0
A Benchmark for Learning to Translate a New Language from One Grammar Book	Sep 28, 2023	In-Context LearningMachine Translation	CodeCode Available	0
Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization	Sep 27, 2023	Machine TranslationTranslation	—Unverified	0
Enhancing Sharpness-Aware Optimization Through Variance Suppression	Sep 27, 2023	Data Augmentationimage-classification	CodeCode Available	1
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing	Sep 27, 2023	DecoderMachine Translation	—Unverified	0
Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution	Sep 27, 2023	Machine TranslationManagement	—Unverified	0

Show:10 25 50

← PrevPage 22 of 216Next →

All datasets WMT2014 English-German WMT2014 English-French IWSLT2014 German-English ACES WMT2016 English-Romanian WMT2016 Romanian-English WMT2014 German-English IWSLT2015 German-English WMT2016 English-German IWSLT2015 English-Vietnamese IWSLT2015 English-German WMT2016 German-English

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Transformer Cycle (Rev)	BLEU score	35.14	—	Unverified
2	Noisy back-translation	BLEU score	35	—	Unverified
3	Transformer+Rep(Uni)	BLEU score	33.89	—	Unverified
4	T5-11B	BLEU score	32.1	—	Unverified
5	BiBERT	BLEU score	31.26	—	Unverified
6	Transformer + R-Drop	BLEU score	30.91	—	Unverified
7	Bi-SimCut	BLEU score	30.78	—	Unverified
8	BERT-fused NMT	BLEU score	30.75	—	Unverified
9	Data Diversification - Transformer	BLEU score	30.7	—	Unverified
10	SimCut	BLEU score	30.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer+BT (ADMIN init)	BLEU score	46.4	—	Unverified
2	Noisy back-translation	BLEU score	45.6	—	Unverified
3	mRASP+Fine-Tune	BLEU score	44.3	—	Unverified
4	Transformer + R-Drop	BLEU score	43.95	—	Unverified
5	Transformer (ADMIN init)	BLEU score	43.8	—	Unverified
6	Admin	BLEU score	43.8	—	Unverified
7	BERT-fused NMT	BLEU score	43.78	—	Unverified
8	MUSE(Paralllel Multi-scale Attention)	BLEU score	43.5	—	Unverified
9	T5	BLEU score	43.4	—	Unverified
10	Local Joint Self-attention	BLEU score	43.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PiNMT	BLEU score	40.43	—	Unverified
2	BiBERT	BLEU score	38.61	—	Unverified
3	Bi-SimCut	BLEU score	38.37	—	Unverified
4	Cutoff + Relaxed Attention + LM	BLEU score	37.96	—	Unverified
5	DRDA	BLEU score	37.95	—	Unverified
6	Transformer + R-Drop + Cutoff	BLEU score	37.9	—	Unverified
7	SimCut	BLEU score	37.81	—	Unverified
8	Cutoff+Knee	BLEU score	37.78	—	Unverified
9	Cutoff	BLEU score	37.6	—	Unverified
10	CipherDAug	BLEU score	37.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HWTSC-Teacher-Sim	Score	19.97	—	Unverified
2	MS-COMET-22	Score	19.89	—	Unverified
3	MS-COMET-QE-22	Score	19.76	—	Unverified
4	KG-BERTScore	Score	17.28	—	Unverified
5	metricx_xl_DA_2019	Score	17.17	—	Unverified
6	COMET-QE	Score	16.8	—	Unverified
7	COMET-22	Score	16.31	—	Unverified
8	UniTE-src	Score	15.68	—	Unverified
9	UniTE-ref	Score	15.38	—	Unverified
10	metricx_xxl_DA_2019	Score	15.24	—	Unverified