SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 501550 of 10752 papers

TitleStatusHype
Investigating Sparsity in Recurrent Neural NetworksCode1
Generating Gender Alternatives in Machine Translation0
Teaching LLMs at Charles University: Assignments and Activities0
Simply Trainable Nearest Neighbour Machine Translation with GPU Inference0
The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs0
Advancing Neural Network Performance through Emergence-Promoting Initialization SchemeCode0
Granularity is crucial when applying differential privacy to text: An investigation for neural machine translationCode0
Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude WordsCode0
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language ModelsCode0
Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines0
Fine-grained Gender Control in Machine Translation with Large Language Models0
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data0
CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation UnitsCode0
Towards Zero-Shot Multimodal Machine TranslationCode0
Translate-and-Revise: Boosting Large Language Models for Constrained Translation0
Fixed and Adaptive Simultaneous Machine Translation Strategies Using AdaptersCode0
MASIVE: Open-Ended Affective State Identification in English and SpanishCode0
Ancient Korean Archive Translation: Comparison Analysis on Statistical phrase alignment, LLM in-context learning, and inter-methodological approach0
Scaling Sign Language Translation0
LLMs-in-the-loop Part-1: Expert Small AI Models for Bio-Medical Text Translation0
AraFinNLP 2024: The First Arabic Financial NLP Shared Task0
sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting0
Towards Chapter-to-Chapter Context-Aware Literary Translation via Large Language Models0
DAHRS: Divergence-Aware Hallucination-Remediated SRL Projection0
Rule-Based, Neural and LLM Back-Translation: Comparative Insights from a Variant of Ladin0
Learning Program Behavioral Models from Synthesized Input-Output PairsCode1
Tamil Language Computing: the Present and the Future0
Arabic Automatic Story Generation with Large Language ModelsCode0
Segment-Based Interactive Machine Translation for Pre-trained Models0
Enhancing Low-Resource NMT with a Multilingual Encoder and Knowledge Distillation: A Case StudyCode0
An Automatic Quality Metric for Evaluating Simultaneous Interpretation0
Large Language Models for Judicial Entity Extraction: A Comparative Study0
How Effective are State Space Models for Machine Translation?Code0
Predicting Word Similarity in Context with Referential Translation Machines0
Rethinking Targeted Adversarial Attacks For Neural Machine TranslationCode0
SmurfCat at PAN 2024 TextDetox: Alignment of Multilingual Transformers for Text DetoxificationCode0
Enhancing Language Learning through Technology: Introducing a New English-Azerbaijani (Arabic Script) Parallel Corpus0
NADI 2024: The Fifth Nuanced Arabic Dialect Identification Shared Task0
Automatic Prediction of the Performance of Every Parser0
Identifying Intensity of the Structure and Content in Tweets and the Discriminative Power of Attributes in Context with Referential Translation Machines0
Toucan: Many-to-Many Translation for 150 African Language PairsCode0
QET: Enhancing Quantized LLM Parameters and KV cache Compression through Element Substitution and Residual Clustering0
Finetuning End-to-End Models for Estonian Conversational Spoken Language Translation0
Regurgitative Training: The Value of Real Data in Training Large Language Models0
Sentence-level Aggregation of Lexical Metrics Correlates Stronger with Human Judgements than Corpus-level Aggregation0
CATT: Character-based Arabic Tashkeel TransformerCode2
A Case Study on Context-Aware Neural Machine Translation with Multi-Task Learning0
Evaluating Automatic Metrics with Incremental Machine Translation SystemsCode0
Translatotron-V(ison): An End-to-End Model for In-Image Machine TranslationCode1
How to Learn in a Noisy World? Self-Correcting the Real-World Data Noise on Machine Translation0
Show:102550
← PrevPage 11 of 216Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer Cycle (Rev)BLEU score35.14Unverified
2Noisy back-translationBLEU score35Unverified
3Transformer+Rep(Uni)BLEU score33.89Unverified
4T5-11BBLEU score32.1Unverified
5BiBERTBLEU score31.26Unverified
6Transformer + R-DropBLEU score30.91Unverified
7Bi-SimCutBLEU score30.78Unverified
8BERT-fused NMTBLEU score30.75Unverified
9Data Diversification - TransformerBLEU score30.7Unverified
10SimCutBLEU score30.56Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer+BT (ADMIN init)BLEU score46.4Unverified
2Noisy back-translationBLEU score45.6Unverified
3mRASP+Fine-TuneBLEU score44.3Unverified
4Transformer + R-DropBLEU score43.95Unverified
5AdminBLEU score43.8Unverified
6Transformer (ADMIN init)BLEU score43.8Unverified
7BERT-fused NMTBLEU score43.78Unverified
8MUSE(Paralllel Multi-scale Attention)BLEU score43.5Unverified
9T5BLEU score43.4Unverified
10Local Joint Self-attentionBLEU score43.3Unverified
#ModelMetricClaimedVerifiedStatus
1PiNMTBLEU score40.43Unverified
2BiBERTBLEU score38.61Unverified
3Bi-SimCutBLEU score38.37Unverified
4Cutoff + Relaxed Attention + LMBLEU score37.96Unverified
5DRDABLEU score37.95Unverified
6Transformer + R-Drop + CutoffBLEU score37.9Unverified
7SimCutBLEU score37.81Unverified
8Cutoff+KneeBLEU score37.78Unverified
9CutoffBLEU score37.6Unverified
10CipherDAugBLEU score37.53Unverified
#ModelMetricClaimedVerifiedStatus
1HWTSC-Teacher-SimScore19.97Unverified
2MS-COMET-22Score19.89Unverified
3MS-COMET-QE-22Score19.76Unverified
4KG-BERTScoreScore17.28Unverified
5metricx_xl_DA_2019Score17.17Unverified
6COMET-QEScore16.8Unverified
7COMET-22Score16.31Unverified
8UniTE-srcScore15.68Unverified
9UniTE-refScore15.38Unverified
10metricx_xxl_DA_2019Score15.24Unverified