SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 10511100 of 10752 papers

TitleStatusHype
Towards General Error Diagnosis via Behavioral Testing in Machine TranslationCode0
Steering Large Language Models for Machine Translation with Finetuning and In-Context Learning0
Simultaneous Machine Translation with Tailored Reference0
Ask Language Model to Clean Your Noisy Translation Data0
A Use Case: Reformulating Query Rewriting as a Statistical Machine Translation Problem0
Direct Neural Machine Translation with Task-level Mixture of Experts models0
GRI: Graph-based Relative Isomorphism of Word Embedding SpacesCode0
A Tale of Pronouns: Interpretability Informs Gender Bias Mitigation for Fairer Instruction-Tuned Machine TranslationCode0
knn-seq: Efficient, Extensible kNN-MT FrameworkCode1
Document-Level Language Models for Machine Translation0
Program Translation via Code Distillation0
IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation SystemsCode0
An Empirical Study of Translation Hypothesis Ensembling with Large Language ModelsCode0
Exploring Automatic Evaluation Methods based on a Decoder-based LLM for Text Generation0
Enhancing Neural Machine Translation with Semantic UnitsCode0
Long-form Simultaneous Speech Translation: Thesis Proposal0
xCOMET: Transparent Machine Translation Evaluation through Fine-grained Error DetectionCode1
Towards a Better Understanding of Variations in Zero-Shot Neural Machine Translation PerformanceCode0
UvA-MT's Participation in the WMT23 General Translation Shared Task0
MILPaC: A Novel Benchmark for Evaluating Translation of Legal Text to Indian LanguagesCode0
Attentive Multi-Layer Perceptron for Non-autoregressive GenerationCode0
Human-in-the-loop Machine Translation with Large Language ModelCode0
Towards Example-Based NMT with Multi-Levenshtein TransformersCode0
Political claim identification and categorization in a multilingual setting: First experiments0
xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation BenchmarkCode0
Why bother with geometry? On the relevance of linear decompositions of Transformer embeddingsCode0
Crossing the Threshold: Idiomatic Machine Translation through Retrieval Augmentation and Loss WeightingCode0
Quality-Aware Translation Models: Efficient Generation and Quality Estimation in a Single Model0
In-Context Explainers: Harnessing LLMs for Explaining Black Box ModelsCode1
Larth: Dataset and Machine Translation for EtruscanCode0
Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting0
Synslator: An Interactive Machine Translation Tool with Online Learning0
CodeTransOcean: A Comprehensive Multilingual Benchmark for Code TranslationCode1
Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE CorpusCode0
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPTCode2
DecoderLens: Layerwise Interpretation of Encoder-Decoder TransformersCode0
Stack Attention: Improving the Ability of Transformers to Model Hierarchical PatternsCode1
Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness0
Nugget: Neural Agglomerative Embeddings of TextCode0
Necessary and Sufficient Watermark for Large Language Models0
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models0
Quantifying the Plausibility of Context Reliance in Neural Machine TranslationCode2
Colloquial Persian POS (CPPOS) Corpus: A Novel Corpus for Colloquial Persian Part of Speech Tagging0
Sparse Backpropagation for MoE Training0
Unlikelihood Tuning on Negative Samples Amazingly Improves Zero-Shot TranslationCode0
A Benchmark for Learning to Translate a New Language from One Grammar BookCode0
Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization0
Enhancing Sharpness-Aware Optimization Through Variance SuppressionCode1
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing0
Developing automatic verbatim transcripts for international multilingual meetings: an end-to-end solution0
Show:102550
← PrevPage 22 of 216Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer Cycle (Rev)BLEU score35.14Unverified
2Noisy back-translationBLEU score35Unverified
3Transformer+Rep(Uni)BLEU score33.89Unverified
4T5-11BBLEU score32.1Unverified
5BiBERTBLEU score31.26Unverified
6Transformer + R-DropBLEU score30.91Unverified
7Bi-SimCutBLEU score30.78Unverified
8BERT-fused NMTBLEU score30.75Unverified
9Data Diversification - TransformerBLEU score30.7Unverified
10SimCutBLEU score30.56Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer+BT (ADMIN init)BLEU score46.4Unverified
2Noisy back-translationBLEU score45.6Unverified
3mRASP+Fine-TuneBLEU score44.3Unverified
4Transformer + R-DropBLEU score43.95Unverified
5Transformer (ADMIN init)BLEU score43.8Unverified
6AdminBLEU score43.8Unverified
7BERT-fused NMTBLEU score43.78Unverified
8MUSE(Paralllel Multi-scale Attention)BLEU score43.5Unverified
9T5BLEU score43.4Unverified
10Local Joint Self-attentionBLEU score43.3Unverified
#ModelMetricClaimedVerifiedStatus
1PiNMTBLEU score40.43Unverified
2BiBERTBLEU score38.61Unverified
3Bi-SimCutBLEU score38.37Unverified
4Cutoff + Relaxed Attention + LMBLEU score37.96Unverified
5DRDABLEU score37.95Unverified
6Transformer + R-Drop + CutoffBLEU score37.9Unverified
7SimCutBLEU score37.81Unverified
8Cutoff+KneeBLEU score37.78Unverified
9CutoffBLEU score37.6Unverified
10CipherDAugBLEU score37.53Unverified
#ModelMetricClaimedVerifiedStatus
1HWTSC-Teacher-SimScore19.97Unverified
2MS-COMET-22Score19.89Unverified
3MS-COMET-QE-22Score19.76Unverified
4KG-BERTScoreScore17.28Unverified
5metricx_xl_DA_2019Score17.17Unverified
6COMET-QEScore16.8Unverified
7COMET-22Score16.31Unverified
8UniTE-srcScore15.68Unverified
9UniTE-refScore15.38Unverified
10metricx_xxl_DA_2019Score15.24Unverified