SOTAVerified

Multimodal Machine Translation

Multimodal machine translation is the task of doing machine translation with multiple data sources - for example, translating "a bird is flying over water" + an image of a bird over water to German text.

( Image credit: Findings of the Third Shared Task on Multimodal Machine Translation )

Papers

Showing 5175 of 108 papers

TitleStatusHype
ViTA: Visual-Linguistic Translation by Aligning Object TagsCode0
Good for Misconceived Reasons: An Empirical Revisiting on the Need for Visual Context in Multimodal Machine Translation0
Gumbel-Attention for Multi-modal Machine Translation0
Good for Misconceived Reasons: Revisiting Neural Multimodal Machine Translation0
Efficient Object-Level Visual Context Modeling for Multimodal Machine Translation: Masking Irrelevant Objects Helps Grounding0
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish0
TMU Japanese-English Multimodal Machine Translation System for WAT 20200
Generative Imagination Elevates Machine Translation0
A Visually-Grounded Parallel Corpus with Phrase-to-Region Linking0
Investigating the Decoders of Maximum Likelihood Sequence Models: A Look-ahead Approach0
Multimodal Machine Translation through Visuals and Speech0
Adaptive Fusion Techniques for Multimodal Data0
Understanding the Effect of Textual Adversaries in Multimodal Machine Translation0
Transformer-based Cascaded Multimodal Speech Translation0
On Leveraging the Visual Modality for Neural Machine Translation0
Probing Representations Learned by Multimodal Recurrent and Transformer Models0
Multilingual Multimodal Machine Translation for Dravidian Languages utilizing Phonetic Transcription0
Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine Translation0
Distilling Translations with Visual AwarenessCode0
Grounded Word Sense Translation0
Debiasing Word Embeddings Improves Multimodal Machine Translation0
Multimodal Machine Translation with Embedding PredictionCode0
Probing the Need for Visual Context in Multimodal Machine Translation0
Latent Variable Model for Multi-modal TranslationCode0
UMONS Submission for WMT18 Multimodal Translation TaskCode0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1delMeteor (EN-FR)74.6Unverified
2ERNIE-UniX2BLEU (EN-DE)49.3Unverified
3IKD-MMTBLEU (EN-DE)41.28Unverified
4DCCNBLEU (EN-DE)39.7Unverified
5CaglayanBLEU (EN-DE)39.4Unverified
6Gumbel-Attention MMTBLEU (EN-DE)39.2Unverified
7Multimodal TransformerBLEU (EN-DE)38.7Unverified
8ImagiTBLEU (EN-DE)38.4Unverified
9del+objBLEU (EN-DE)38Unverified
10VMMTFBLEU (EN-DE)37.6Unverified
#ModelMetricClaimedVerifiedStatus
1ViTABLEU (EN-HI)51.6Unverified
#ModelMetricClaimedVerifiedStatus
1ViTABLEU (EN-HI)44.6Unverified