SOTAVerified

Multimodal Machine Translation

Multimodal machine translation is the task of doing machine translation with multiple data sources - for example, translating "a bird is flying over water" + an image of a bird over water to German text.

( Image credit: Findings of the Third Shared Task on Multimodal Machine Translation )

Papers

Showing 91100 of 108 papers

TitleStatusHype
Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18 Multimodal Machine Translation System Report0
ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation0
Experiences of Adapting Multimodal Machine Translation Techniques for Hindi0
Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets0
Findings of the 2016 Conference on Machine Translation0
Findings of the 2017 Conference on Machine Translation (WMT17)0
Findings of the 2018 Conference on Machine Translation (WMT18)0
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description0
Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models0
Generalization algorithm of multimodal pre-training model based on graph-text self-supervised training0
Show:102550
← PrevPage 10 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1delMeteor (EN-FR)74.6Unverified
2ERNIE-UniX2BLEU (EN-DE)49.3Unverified
3IKD-MMTBLEU (EN-DE)41.28Unverified
4DCCNBLEU (EN-DE)39.7Unverified
5CaglayanBLEU (EN-DE)39.4Unverified
6Gumbel-Attention MMTBLEU (EN-DE)39.2Unverified
7Multimodal TransformerBLEU (EN-DE)38.7Unverified
8ImagiTBLEU (EN-DE)38.4Unverified
9del+objBLEU (EN-DE)38Unverified
10VMMTFBLEU (EN-DE)37.6Unverified
#ModelMetricClaimedVerifiedStatus
1ViTABLEU (EN-HI)51.6Unverified
#ModelMetricClaimedVerifiedStatus
1ViTABLEU (EN-HI)44.6Unverified