Machine Translation

Machine translation is the task of translating a sentence in a source language to a different target language.

Approaches for machine translation can range from rule-based to statistical to neural-based. More recently, encoder-decoder attention-based architectures like BERT have attained major improvements in machine translation.

One of the most popular datasets used to benchmark machine translation systems is the WMT family of datasets. Some of the most commonly used evaluation metrics for machine translation systems include BLEU, METEOR, NIST, and others.

( Image credit: Google seq2seq )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10551–10600 of 10752 papers

Title	Date	Tasks	Status
Error profiling for evaluation of machine-translated text: a Polish-English case study	May 1, 2012	Machine TranslationTranslation	—Unverified
Assessing the Comparability of News Texts	May 1, 2012	Machine TranslationNatural Language Inference	—Unverified
Joint Segmentation and POS Tagging for Arabic Using a CRF-based Classifier	May 1, 2012	ArticlesBIG-bench Machine Learning	—Unverified
Latvian and Lithuanian Named Entity Recognition with TildeNER	May 1, 2012	Machine Translationnamed-entity-recognition	—Unverified
Chinese Whispers: Cooperative Paraphrase Acquisition	May 1, 2012	Machine TranslationNatural Language Inference	—Unverified
PET: a Tool for Post-editing and Assessing Machine Translation	May 1, 2012	Machine TranslationSentence	—Unverified
Linguistic Analysis Processing Line for Bulgarian	May 1, 2012	Language ModellingLemmatization	—Unverified
A Holistic Approach to Bilingual Sentence Fragment Extraction from Comparable Corpora	May 1, 2012	Boundary DetectionMachine Translation	—Unverified
Automatic Translation of Scholarly Terms into Patent Terms Using Synonym Extraction Techniques	May 1, 2012	Information RetrievalMachine Translation	—Unverified
CLCM - A Linguistic Resource for Effective Simplification of Instructions in the Crisis Management Domain and its Evaluations	May 1, 2012	Machine TranslationManagement	—Unverified
Le Petit Prince in UNL	May 1, 2012	Information RetrievalMachine Translation	—Unverified
LDC Language Resource Database: Building a Bibliographic Database	May 1, 2012	Information RetrievalMachine Translation	—Unverified
BUCEADOR, a multi-language search engine for digital libraries	May 1, 2012	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.	May 1, 2012	ArticlesMachine Translation	—Unverified
New language resources for the Pashto language	May 1, 2012	Machine TranslationTranslation	—Unverified
Expanding Parallel Resources for Medium-Density Languages for Free	May 1, 2012	Machine TranslationMorphological Analysis	—Unverified
Integrating NLP Tools in a Distributed Environment: A Case Study Chaining a Tagger with a Dependency Parser	May 1, 2012	Machine TranslationPOS	—Unverified
Effort of Genre Variation and Prediction of System Performance	May 1, 2012	Domain AdaptationLanguage Modelling	—Unverified
Annotated Corpora for Word Alignment between Japanese and English and its Evaluation with MAP-based Word Aligner	May 1, 2012	Machine TranslationSentence	—Unverified
Evaluation of Classification Algorithms and Features for Collocation Extraction in Croatian	May 1, 2012	General ClassificationKeyword Extraction	—Unverified
A Repository of Data and Evaluation Resources for Natural Language Generation	May 1, 2012	Data-to-Text GenerationMachine Translation	—Unverified
A Study of Word-Classing for MT Reordering	May 1, 2012	Dependency ParsingLanguage Modelling	—Unverified
Large aligned treebanks for syntax-based machine translation	May 1, 2012	Language ModellingMachine Translation	—Unverified
A GUI to Detect and Correct Errors in Hindi Dependency Treebank	May 1, 2012	Machine Translation	—Unverified
Service Composition Scenarios for Task-Oriented Translation	May 1, 2012	Domain AdaptationLanguage Modelling	—Unverified
Semi-Automatic Sign Language Corpora Annotation using Lexical Representations of Signs	May 1, 2012	Hand SegmentationMachine Translation	—Unverified
Same domain different discourse style - A case study on Language Resources for data-driven Machine Translation	May 1, 2012	Information RetrievalMachine Translation	—Unverified
On the practice of error analysis for machine translation evaluation	May 1, 2012	Machine TranslationTranslation	—Unverified
Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation	May 1, 2012	ChunkingMachine Translation	—Unverified
An Analytical Model of Language Resource Sustainability	May 1, 2012	DescriptiveInformation Retrieval	—Unverified
Building a 70 billion word corpus of English from ClueWeb	May 1, 2012	Machine TranslationManagement	—Unverified
Collecting and Using Comparable Corpora for Statistical Machine Translation	May 1, 2012	Machine TranslationTranslation	—Unverified
Customization of the Europarl Corpus for Translation Studies	May 1, 2012	Machine TranslationRelation	—Unverified
An implementation of a Latvian resource grammar in Grammatical Framework	May 1, 2012	Machine TranslationSemantic Parsing	—Unverified
Alignment-based reordering for SMT	May 1, 2012	Machine TranslationPart-Of-Speech Tagging	—Unverified
Identifying bilingual Multi-Word Expressions for Statistical Machine Translation	May 1, 2012	Machine TranslationTranslation	—Unverified
Romanian TimeBank: An Annotated Parallel Corpus for Temporal Information	May 1, 2012	Information RetrievalMachine Translation	—Unverified
Constraint Based Description of Polish Multiword Expressions	May 1, 2012	Machine TranslationMorphological Analysis	—Unverified
Linguagrid: a network of Linguistic and Semantic Services for the Italian Language.	May 1, 2012	ClusteringDependency Parsing	—Unverified
A finite-state morphological transducer for Kyrgyz	May 1, 2012	Machine TranslationMorphological Analysis	CodeCode Available
Assessing Divergence Measures for Automated Document Routing in an Adaptive MT System	May 1, 2012	Document ClassificationMachine Translation	—Unverified
Parsing Any Domain English text to CoNLL dependencies	May 1, 2012	BenchmarkingDependency Parsing	—Unverified
Development and Application of a Cross-language Document Comparability Metric	May 1, 2012	Machine TranslationTranslation	—Unverified
Measuring the Divergence of Dependency Structures Cross-Linguistically to Improve Syntactic Projection Algorithms	May 1, 2012	Machine TranslationTranslation	—Unverified
English to Indonesian Transliteration to Support English Pronunciation Practice	May 1, 2012	Information RetrievalLanguage Modeling	—Unverified
Diversifiable Bootstrapping for Acquiring High-Coverage Paraphrase Resource	May 1, 2012	Information RetrievalMachine Translation	—Unverified
Buildind a Resource of Patterns Using Semantic Types	May 1, 2012	Machine TranslationNatural Language Inference	—Unverified
RWTH-PHOENIX-Weather: A Large Vocabulary Sign Language Recognition and Translation Corpus	May 1, 2012	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Dealing with unknown words in statistical machine translation	May 1, 2012	Machine TranslationTranslation	—Unverified
The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation	May 1, 2012	Machine Translationspeech-recognition	—Unverified

Show:10 25 50

← PrevPage 212 of 216Next →

All datasets WMT2014 English-German WMT2014 English-French IWSLT2014 German-English ACES WMT2016 English-Romanian WMT2016 Romanian-English WMT2014 German-English IWSLT2015 German-English WMT2016 English-German IWSLT2015 English-Vietnamese IWSLT2015 English-German WMT2016 German-English

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Transformer Cycle (Rev)	BLEU score	35.14	—	Unverified
2	Noisy back-translation	BLEU score	35	—	Unverified
3	Transformer+Rep(Uni)	BLEU score	33.89	—	Unverified
4	T5-11B	BLEU score	32.1	—	Unverified
5	BiBERT	BLEU score	31.26	—	Unverified
6	Transformer + R-Drop	BLEU score	30.91	—	Unverified
7	Bi-SimCut	BLEU score	30.78	—	Unverified
8	BERT-fused NMT	BLEU score	30.75	—	Unverified
9	Data Diversification - Transformer	BLEU score	30.7	—	Unverified
10	SimCut	BLEU score	30.56	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Transformer+BT (ADMIN init)	BLEU score	46.4	—	Unverified
2	Noisy back-translation	BLEU score	45.6	—	Unverified
3	mRASP+Fine-Tune	BLEU score	44.3	—	Unverified
4	Transformer + R-Drop	BLEU score	43.95	—	Unverified
5	Transformer (ADMIN init)	BLEU score	43.8	—	Unverified
6	Admin	BLEU score	43.8	—	Unverified
7	BERT-fused NMT	BLEU score	43.78	—	Unverified
8	MUSE(Paralllel Multi-scale Attention)	BLEU score	43.5	—	Unverified
9	T5	BLEU score	43.4	—	Unverified
10	Local Joint Self-attention	BLEU score	43.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PiNMT	BLEU score	40.43	—	Unverified
2	BiBERT	BLEU score	38.61	—	Unverified
3	Bi-SimCut	BLEU score	38.37	—	Unverified
4	Cutoff + Relaxed Attention + LM	BLEU score	37.96	—	Unverified
5	DRDA	BLEU score	37.95	—	Unverified
6	Transformer + R-Drop + Cutoff	BLEU score	37.9	—	Unverified
7	SimCut	BLEU score	37.81	—	Unverified
8	Cutoff+Knee	BLEU score	37.78	—	Unverified
9	Cutoff	BLEU score	37.6	—	Unverified
10	CipherDAug	BLEU score	37.53	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	HWTSC-Teacher-Sim	Score	19.97	—	Unverified
2	MS-COMET-22	Score	19.89	—	Unverified
3	MS-COMET-QE-22	Score	19.76	—	Unverified
4	KG-BERTScore	Score	17.28	—	Unverified
5	metricx_xl_DA_2019	Score	17.17	—	Unverified
6	COMET-QE	Score	16.8	—	Unverified
7	COMET-22	Score	16.31	—	Unverified
8	UniTE-src	Score	15.68	—	Unverified
9	UniTE-ref	Score	15.38	—	Unverified
10	metricx_xxl_DA_2019	Score	15.24	—	Unverified