Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 10752 papers

Title	Date	Tasks	Status	Hype
Investigating Sparsity in Recurrent Neural Networks	Jul 30, 2024	Machine TranslationNetwork Pruning	CodeCode Available	1
Generating Gender Alternatives in Machine Translation	Jul 29, 2024	Machine TranslationTranslation	—Unverified	0
Teaching LLMs at Charles University: Assignments and Activities	Jul 29, 2024	Machine TranslationTranslation	—Unverified	0
Simply Trainable Nearest Neighbour Machine Translation with GPU Inference	Jul 29, 2024	Domain AdaptationGPU	—Unverified	0
The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs	Jul 26, 2024	Machine TranslationNMT	—Unverified	0
Advancing Neural Network Performance through Emergence-Promoting Initialization Scheme	Jul 26, 2024	Machine Translation	CodeCode Available	0
Granularity is crucial when applying differential privacy to text: An investigation for neural machine translation	Jul 26, 2024	Machine TranslationNMT	CodeCode Available	0
Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude Words	Jul 23, 2024	Machine TranslationTranslation	CodeCode Available	0
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models	Jul 23, 2024	HallucinationMachine Translation	CodeCode Available	0
Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines	Jul 22, 2024	Language ModelingLanguage Modelling	—Unverified	0
Fine-grained Gender Control in Machine Translation with Large Language Models	Jul 21, 2024	Machine TranslationSentence	—Unverified	0
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data	Jul 20, 2024	Language ModellingMachine Translation	—Unverified	0
CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation Units	Jul 19, 2024	Machine TranslationSpeech-to-Text	CodeCode Available	0
Towards Zero-Shot Multimodal Machine Translation	Jul 18, 2024	Language ModellingMachine Translation	CodeCode Available	0
Translate-and-Revise: Boosting Large Language Models for Constrained Translation	Jul 18, 2024	Machine TranslationNMT	—Unverified	0
Fixed and Adaptive Simultaneous Machine Translation Strategies Using Adapters	Jul 18, 2024	DecoderMachine Translation	CodeCode Available	0
MASIVE: Open-Ended Affective State Identification in English and Spanish	Jul 16, 2024	Emotion RecognitionMachine Translation	CodeCode Available	0
Ancient Korean Archive Translation: Comparison Analysis on Statistical phrase alignment, LLM in-context learning, and inter-methodological approach	Jul 16, 2024	In-Context LearningMachine Translation	—Unverified	0
Scaling Sign Language Translation	Jul 16, 2024	DecoderGloss-free Sign Language Translation	—Unverified	0
LLMs-in-the-loop Part-1: Expert Small AI Models for Bio-Medical Text Translation	Jul 16, 2024	ArticlesMachine Translation	—Unverified	0
AraFinNLP 2024: The First Arabic Financial NLP Shared Task	Jul 13, 2024	Intent DetectionMachine Translation	—Unverified	0
sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting	Jul 13, 2024	Machine TranslationQuestion Answering	—Unverified	0
Towards Chapter-to-Chapter Context-Aware Literary Translation via Large Language Models	Jul 12, 2024	Machine TranslationSentence	—Unverified	0
DAHRS: Divergence-Aware Hallucination-Remediated SRL Projection	Jul 12, 2024	fr-enHallucination	—Unverified	0
Rule-Based, Neural and LLM Back-Translation: Comparative Insights from a Variant of Ladin	Jul 11, 2024	Language ModelingLanguage Modelling	—Unverified	0
Learning Program Behavioral Models from Synthesized Input-Output Pairs	Jul 11, 2024	Machine Translation	CodeCode Available	1
Tamil Language Computing: the Present and the Future	Jul 11, 2024	Language ModellingMachine Translation	—Unverified	0
Arabic Automatic Story Generation with Large Language Models	Jul 10, 2024	Machine TranslationStory Generation	CodeCode Available	0
Segment-Based Interactive Machine Translation for Pre-trained Models	Jul 9, 2024	Machine TranslationNMT	—Unverified	0
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case Study	Jul 9, 2024	Knowledge DistillationLanguage Modeling	CodeCode Available	0
An Automatic Quality Metric for Evaluating Simultaneous Interpretation	Jul 9, 2024	Machine TranslationTranslation	—Unverified	0
Large Language Models for Judicial Entity Extraction: A Comparative Study	Jul 8, 2024	Information RetrievalLanguage Modeling	—Unverified	0
How Effective are State Space Models for Machine Translation?	Jul 7, 2024	Machine TranslationMamba	CodeCode Available	0
Predicting Word Similarity in Context with Referential Translation Machines	Jul 7, 2024	Machine TranslationTranslation	—Unverified	0
Rethinking Targeted Adversarial Attacks For Neural Machine Translation	Jul 7, 2024	Adversarial AttackMachine Translation	CodeCode Available	0
SmurfCat at PAN 2024 TextDetox: Alignment of Multilingual Transformers for Text Detoxification	Jul 7, 2024	Data AugmentationMachine Translation	CodeCode Available	0
Enhancing Language Learning through Technology: Introducing a New English-Azerbaijani (Arabic Script) Parallel Corpus	Jul 6, 2024	ArticlesMachine Translation	—Unverified	0
NADI 2024: The Fifth Nuanced Arabic Dialect Identification Shared Task	Jul 6, 2024	Dialect IdentificationMachine Translation	—Unverified	0
Automatic Prediction of the Performance of Every Parser	Jul 6, 2024	Dimensionality ReductionMachine Translation	—Unverified	0
Identifying Intensity of the Structure and Content in Tweets and the Discriminative Power of Attributes in Context with Referential Translation Machines	Jul 6, 2024	AttributeMachine Translation	—Unverified	0
Toucan: Many-to-Many Translation for 150 African Language Pairs	Jul 5, 2024	Machine TranslationTranslation	CodeCode Available	0
QET: Enhancing Quantized LLM Parameters and KV cache Compression through Element Substitution and Residual Clustering	Jul 4, 2024	Computational EfficiencyEdge-computing	—Unverified	0
Finetuning End-to-End Models for Estonian Conversational Spoken Language Translation	Jul 4, 2024	Machine Translationspeech-recognition	—Unverified	0
Regurgitative Training: The Value of Real Data in Training Large Language Models	Jul 3, 2024	DiversityLarge Language Model	—Unverified	0
Sentence-level Aggregation of Lexical Metrics Correlates Stronger with Human Judgements than Corpus-level Aggregation	Jul 3, 2024	Machine TranslationSentence	—Unverified	0
CATT: Character-based Arabic Tashkeel Transformer	Jul 3, 2024	Arabic Text DiacritizationDecoder	CodeCode Available	2
A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning	Jul 3, 2024	Machine TranslationMulti-Task Learning	—Unverified	0
Evaluating Automatic Metrics with Incremental Machine Translation Systems	Jul 3, 2024	Machine TranslationTranslation	CodeCode Available	0
Translatotron-V(ison): An End-to-End Model for In-Image Machine Translation	Jul 3, 2024	DecoderMachine Translation	CodeCode Available	1
How to Learn in a Noisy World? Self-Correcting the Real-World Data Noise on Machine Translation	Jul 2, 2024	Machine TranslationSemantic Similarity	—Unverified	0

Show:10 25 50

← PrevPage 11 of 216Next →

All datasets WMT2014 English-German WMT2014 English-French IWSLT2014 German-English ACES WMT2016 English-Romanian WMT2016 Romanian-English WMT2014 German-English IWSLT2015 German-English WMT2016 English-German IWSLT2015 English-Vietnamese IWSLT2015 English-German WMT2016 German-English

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Transformer Cycle (Rev)	BLEU score	35.14	—	Unverified
2	Noisy back-translation	BLEU score	35	—	Unverified
3	Transformer+Rep(Uni)	BLEU score	33.89	—	Unverified
4	T5-11B	BLEU score	32.1	—	Unverified
5	BiBERT	BLEU score	31.26	—	Unverified
6	Transformer + R-Drop	BLEU score	30.91	—	Unverified
7	Bi-SimCut	BLEU score	30.78	—	Unverified
8	BERT-fused NMT	BLEU score	30.75	—	Unverified
9	Data Diversification - Transformer	BLEU score	30.7	—	Unverified
10	SimCut	BLEU score	30.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer+BT (ADMIN init)	BLEU score	46.4	—	Unverified
2	Noisy back-translation	BLEU score	45.6	—	Unverified
3	mRASP+Fine-Tune	BLEU score	44.3	—	Unverified
4	Transformer + R-Drop	BLEU score	43.95	—	Unverified
5	Admin	BLEU score	43.8	—	Unverified
6	Transformer (ADMIN init)	BLEU score	43.8	—	Unverified
7	BERT-fused NMT	BLEU score	43.78	—	Unverified
8	MUSE(Paralllel Multi-scale Attention)	BLEU score	43.5	—	Unverified
9	T5	BLEU score	43.4	—	Unverified
10	Local Joint Self-attention	BLEU score	43.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PiNMT	BLEU score	40.43	—	Unverified
2	BiBERT	BLEU score	38.61	—	Unverified
3	Bi-SimCut	BLEU score	38.37	—	Unverified
4	Cutoff + Relaxed Attention + LM	BLEU score	37.96	—	Unverified
5	DRDA	BLEU score	37.95	—	Unverified
6	Transformer + R-Drop + Cutoff	BLEU score	37.9	—	Unverified
7	SimCut	BLEU score	37.81	—	Unverified
8	Cutoff+Knee	BLEU score	37.78	—	Unverified
9	Cutoff	BLEU score	37.6	—	Unverified
10	CipherDAug	BLEU score	37.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HWTSC-Teacher-Sim	Score	19.97	—	Unverified
2	MS-COMET-22	Score	19.89	—	Unverified
3	MS-COMET-QE-22	Score	19.76	—	Unverified
4	KG-BERTScore	Score	17.28	—	Unverified
5	metricx_xl_DA_2019	Score	17.17	—	Unverified
6	COMET-QE	Score	16.8	—	Unverified
7	COMET-22	Score	16.31	—	Unverified
8	UniTE-src	Score	15.68	—	Unverified
9	UniTE-ref	Score	15.38	—	Unverified
10	metricx_xxl_DA_2019	Score	15.24	—	Unverified