SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 110 of 10752 papers

TitleStatusHype
Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings0
Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation0
GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation0
TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation0
Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval0
Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation0
Has Machine Translation Evaluation Achieved Human Parity? The Human Reference and the Limits of ProgressCode0
CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical DistillationCode0
Semantic similarity estimation for domain specific data using BERT and other techniques0
Sequence-to-Sequence Models with Attention Mechanistically Map to the Architecture of Human Memory Search0
Show:102550
← PrevPage 1 of 1076Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Exploiting Mono at Scale (single)SacreBLEU47.5Unverified
2FLAN 137B (few-shot, k=11)BLEU score40.7Unverified
3FLAN 137B (zero-shot)BLEU score38.9Unverified
4Attentional encoder-decoder + BPEBLEU score38.6Unverified
5Linguistic Input FeaturesBLEU score32.9Unverified
6SMT + iterative backtranslation (unsupervised)BLEU score23.05Unverified
7Unsupervised NMT + weight-sharingBLEU score14.62Unverified
8Unsupervised S2S with attentionBLEU score13.33Unverified