SOTAVerified

Multimodal Machine Translation

Multimodal machine translation is the task of doing machine translation with multiple data sources - for example, translating "a bird is flying over water" + an image of a bird over water to German text.

( Image credit: Findings of the Third Shared Task on Multimodal Machine Translation )

Papers

Showing 76100 of 108 papers

TitleStatusHype
A Visually-Grounded Parallel Corpus with Phrase-to-Region Linking0
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation0
CUNI System for the WMT18 Multimodal Translation Task0
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks0
DCU-UvA Multimodal MT System Report0
Debiasing Word Embeddings Improves Multimodal Machine Translation0
Detecting Concrete Visual Tokens for Multimodal Machine Translation0
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation0
Doubly Attentive Transformer Machine Translation0
Adaptive Fusion Techniques for Multimodal Data0
Efficient Object-Level Visual Context Modeling for Multimodal Machine Translation: Masking Irrelevant Objects Helps Grounding0
EMMeTT: Efficient Multimodal Machine Translation Training0
Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18 Multimodal Machine Translation System Report0
ERNIE-UniX2: A Unified Cross-lingual Cross-modal Framework for Understanding and Generation0
Experiences of Adapting Multimodal Machine Translation Techniques for Hindi0
Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets0
Findings of the 2016 Conference on Machine Translation0
Findings of the 2017 Conference on Machine Translation (WMT17)0
Findings of the 2018 Conference on Machine Translation (WMT18)0
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description0
Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models0
Generalization algorithm of multimodal pre-training model based on graph-text self-supervised training0
Generating Image Descriptions using Multilingual Data0
Generative Imagination Elevates Machine Translation0
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation0
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1delMeteor (EN-FR)74.6Unverified
2ERNIE-UniX2BLEU (EN-DE)49.3Unverified
3IKD-MMTBLEU (EN-DE)41.28Unverified
4DCCNBLEU (EN-DE)39.7Unverified
5CaglayanBLEU (EN-DE)39.4Unverified
6Gumbel-Attention MMTBLEU (EN-DE)39.2Unverified
7Multimodal TransformerBLEU (EN-DE)38.7Unverified
8ImagiTBLEU (EN-DE)38.4Unverified
9del+objBLEU (EN-DE)38Unverified
10VMMTFBLEU (EN-DE)37.6Unverified
#ModelMetricClaimedVerifiedStatus
1ViTABLEU (EN-HI)51.6Unverified
#ModelMetricClaimedVerifiedStatus
1ViTABLEU (EN-HI)44.6Unverified