SOTAVerified

Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Showing 73017350 of 10752 papers

TitleStatusHype
Simple Compound Splitting for German0
Factoring Ambiguity out of the Prediction of Compositionality for German Multi-Word Expressions0
A Layered Language Model based Hybrid Approach to Automatic Full Diacritization of Arabic0
SHAKKIL: An Automatic Diacritization System for Modern Standard Arabic Texts0
Arabic Textual Entailment with Word Embeddings0
Arabic Dialect Identification Using iVectors and ASR Transcripts0
Adapting a State-of-the-Art Tagger for South Slavic Languages to Non-Standard Text0
Rule-Based Translation of Spanish Verb-Noun Combinations into Basque0
Why Catalan-Spanish Neural Machine Translation? Analysis, comparison and combination with standard Rule and Phrase-based technologies0
Universal Dependencies for Arabic0
A Preliminary Study of Croatian Lexical Substitution0
Word Similarity Datasets for Indian Languages: Annotation and Baseline Systems0
Semantic Similarity of Arabic Sentences with Word Embeddings0
Ethical Considerations in NLP Shared Tasks0
Neural Networks for Multi-Word Expression Detection0
Slavic Forest, Norwegian Wood0
ParaDi: Dictionary of Paraphrases of Czech Complex Predicates with Light Verbs0
Using bilingual word-embeddings for multilingual collocation extraction0
Using Linked Disambiguated Distributional Networks for Word Sense Disambiguation0
Discriminating between Similar Languages Using a Combination of Typed and Untyped Character N-grams and Words0
Discovering Light Verb Constructions and their Translations from Parallel Corpora without Word Alignment0
A Neural Architecture for Dialectal Arabic Segmentation0
Kurdish Interdialect Machine Translation0
Using Coreference Links to Improve Spanish-to-English Machine TranslationCode0
Toward Pan-Slavic NLP: Some Experiments with Language Adaptation0
Automated WordNet Construction Using Word EmbeddingsCode0
Cross-lingual dependency parsing for closely related languages - Helsinki's submission to VarDial 20170
Identifying Effective Translations for Cross-lingual Arabic-to-English User-generated Speech Search0
Human Evaluation of Multi-modal Neural Machine Translation: A Case-Study on E-Commerce Listing Titles0
Parsing and MWE Detection: Fips at the PARSEME Shared Task0
German Dialect Identification in Interview Transcriptions0
Evaluating the Reliability and Interaction of Recursively Used Feature Classes for Terminology Extraction0
The SUMMA Platform Prototype0
Building Web-Interfaces for Vector Semantic Models with the WebVectors Toolkit0
QCRI Live Speech Translation System0
Modelling metaphor with attribute-based semantics0
Machine Translation of Spanish Personal and Possessive Pronouns Using Anaphora ProbabilitiesCode0
Literal or idiomatic? Identifying the reading of single occurrences of German multiword expressions using word embeddings0
Lexicalized Reordering for Left-to-Right Hierarchical Phrase-based Translation0
Using Images to Improve Machine-Translating E-Commerce Product Listings.0
Autobank: a semi-automatic annotation tool for developing deep Minimalist Grammar treebanks0
Co-reference Resolution of Elided Subjects and Possessive Pronouns in Spanish-English Statistical Machine Translation0
Using Word Embedding for Cross-Language Plagiarism Detection0
Continuous multilinguality with language vectors0
Alto: Rapid Prototyping for Parsing and Translation0
Common Round: Application of Language Technologies to Large-Scale Web Debates0
A Rich Morphological Tagger for English: Exploring the Cross-Linguistic Tradeoff Between Morphology and Syntax0
Online Automatic Post-editing for MT in a Multi-Domain Translation Environment0
Building Lexical Vector Representations from Concept Definitions0
Neural vs. Phrase-Based Machine Translation in a Multi-Domain Scenario0
Show:102550
← PrevPage 147 of 216Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer Cycle (Rev)BLEU score35.14Unverified
2Noisy back-translationBLEU score35Unverified
3Transformer+Rep(Uni)BLEU score33.89Unverified
4T5-11BBLEU score32.1Unverified
5BiBERTBLEU score31.26Unverified
6Transformer + R-DropBLEU score30.91Unverified
7Bi-SimCutBLEU score30.78Unverified
8BERT-fused NMTBLEU score30.75Unverified
9Data Diversification - TransformerBLEU score30.7Unverified
10SimCutBLEU score30.56Unverified
#ModelMetricClaimedVerifiedStatus
1Transformer+BT (ADMIN init)BLEU score46.4Unverified
2Noisy back-translationBLEU score45.6Unverified
3mRASP+Fine-TuneBLEU score44.3Unverified
4Transformer + R-DropBLEU score43.95Unverified
5Transformer (ADMIN init)BLEU score43.8Unverified
6AdminBLEU score43.8Unverified
7BERT-fused NMTBLEU score43.78Unverified
8MUSE(Paralllel Multi-scale Attention)BLEU score43.5Unverified
9T5BLEU score43.4Unverified
10Local Joint Self-attentionBLEU score43.3Unverified
#ModelMetricClaimedVerifiedStatus
1PiNMTBLEU score40.43Unverified
2BiBERTBLEU score38.61Unverified
3Bi-SimCutBLEU score38.37Unverified
4Cutoff + Relaxed Attention + LMBLEU score37.96Unverified
5DRDABLEU score37.95Unverified
6Transformer + R-Drop + CutoffBLEU score37.9Unverified
7SimCutBLEU score37.81Unverified
8Cutoff+KneeBLEU score37.78Unverified
9CutoffBLEU score37.6Unverified
10CipherDAugBLEU score37.53Unverified
#ModelMetricClaimedVerifiedStatus
1HWTSC-Teacher-SimScore19.97Unverified
2MS-COMET-22Score19.89Unverified
3MS-COMET-QE-22Score19.76Unverified
4KG-BERTScoreScore17.28Unverified
5metricx_xl_DA_2019Score17.17Unverified
6COMET-QEScore16.8Unverified
7COMET-22Score16.31Unverified
8UniTE-srcScore15.68Unverified
9UniTE-refScore15.38Unverified
10metricx_xxl_DA_2019Score15.24Unverified