SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 110 of 10752 papers

TitleStatusHype
Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings0
Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation0
GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation0
TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation0
Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval0
Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation0
Has Machine Translation Evaluation Achieved Human Parity? The Human Reference and the Limits of ProgressCode0
CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical DistillationCode0
Semantic similarity estimation for domain specific data using BERT and other techniques0
Sequence-to-Sequence Models with Attention Mechanistically Map to the Architecture of Human Memory Search0
Show:102550
← PrevPage 1 of 1076Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Exploiting Mono at Scale (single)SacreBLEU40.9Unverified
2MADLBLEU score40.68Unverified
3Attentional encoder-decoder + BPEBLEU score34.2Unverified
4Linguistic Input FeaturesBLEU score28.4Unverified
5DeLighTBLEU score28Unverified
6FLAN 137B (zero-shot)BLEU score27Unverified
7TransformerBLEU score26.7Unverified
8FLAN 137B (few-shot, k=11)BLEU score26.1Unverified
9BiRNN + GCN (Syn + Sem)BLEU score24.9Unverified
10SMT + iterative backtranslation (unsupervised)BLEU score18.23Unverified