SOTAVerified

Multimodal Machine Translation

Multimodal machine translation is the task of doing machine translation with multiple data sources - for example, translating "a bird is flying over water" + an image of a bird over water to German text.

( Image credit: Findings of the Third Shared Task on Multimodal Machine Translation )

Papers

Showing 51100 of 108 papers

TitleStatusHype
A Visually-Grounded Parallel Corpus with Phrase-to-Region Linking0
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation0
CUNI System for the WMT18 Multimodal Translation Task0
CUNI System for WMT16 Automatic Post-Editing and Multimodal Translation Tasks0
DCU-UvA Multimodal MT System Report0
Debiasing Word Embeddings Improves Multimodal Machine Translation0
Detecting Concrete Visual Tokens for Multimodal Machine Translation0
Doubly-Attentive Decoder for Multi-modal Neural Machine Translation0
Doubly Attentive Transformer Machine Translation0
Adaptive Fusion Techniques for Multimodal Data0
Efficient Object-Level Visual Context Modeling for Multimodal Machine Translation: Masking Irrelevant Objects Helps Grounding0
EMMeTT: Efficient Multimodal Machine Translation Training0
Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18 Multimodal Machine Translation System Report0
Multimodal Neural Machine Translation System for English to Bengali0
MultiNews: A Web collection of an Aligned Multimodal and Multilingual Corpus0
NICT-NAIST System for WMT17 Multimodal Translation Task0
On Leveraging the Visual Modality for Neural Machine Translation0
On Vision Features in Multimodal Machine Translation0
OSU Multimodal Machine Translation System Report0
Probing Representations Learned by Multimodal Recurrent and Transformer Models0
Probing the Need for Visual Context in Multimodal Machine Translation0
Rakuten’s Participation in WAT 2021: Examining the Effectiveness of Pre-trained Models for Multilingual and Multimodal Machine Translation0
Sheffield MultiMT: Using Object Posterior Predictions for Multimodal Machine Translation0
Sheffield Submissions for WMT18 Multimodal Translation Shared Task0
SHEF-Multimodal: Grounding Machine Translation on Images0
Supervised Visual Attention for Simultaneous Multimodal Machine Translation0
The AFRL-Ohio State WMT18 Multimodal System: Combining Visual with Traditional0
The AFRL-OSU WMT17 Multimodal Translation System: An Image Processing Approach0
The Case for Evaluating Multimodal Translation Models on Text Datasets0
The MeMAD Submission to the WMT18 Multimodal Translation Task0
TMU Japanese-English Multimodal Machine Translation System for WAT 20200
Understanding the Effect of Textual Adversaries in Multimodal Machine Translation0
Transformer-based Cascaded Multimodal Speech Translation0
MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish0
Multilingual Multimodal Machine Translation for Dravidian Languages utilizing Phonetic Transcription0
Multimodal Machine Translation through Visuals and Speech0
Multimodal Machine Translation with Reinforcement Learning0
Multimodal Machine Translation with Visual Scene Graph Pruning0
Latent Variable Model for Multi-modal TranslationCode0
Video-Helpful Multimodal Machine TranslationCode0
Multi30K: Multilingual English-German Image DescriptionsCode0
A Visual Attention Grounding Neural Model for Multimodal Machine TranslationCode0
Multimodal Lexical TranslationCode0
Does Multimodality Help Human and Machine for Translation and Image Captioning?Code0
Distilling Translations with Visual AwarenessCode0
Multimodal Machine Translation with Embedding PredictionCode0
Cultural and Geographical Influences on Image Translatability of Words across LanguagesCode0
Vision Matters When It Should: Sanity Checking Multimodal Machine Translation ModelsCode0
Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering PairsCode0
TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for DocumentariesCode0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1delMeteor (EN-FR)74.6Unverified
2ERNIE-UniX2BLEU (EN-DE)49.3Unverified
3IKD-MMTBLEU (EN-DE)41.28Unverified
4DCCNBLEU (EN-DE)39.7Unverified
5CaglayanBLEU (EN-DE)39.4Unverified
6Gumbel-Attention MMTBLEU (EN-DE)39.2Unverified
7Multimodal TransformerBLEU (EN-DE)38.7Unverified
8ImagiTBLEU (EN-DE)38.4Unverified
9del+objBLEU (EN-DE)38Unverified
10VMMTFBLEU (EN-DE)37.6Unverified
#ModelMetricClaimedVerifiedStatus
1ViTABLEU (EN-HI)51.6Unverified
#ModelMetricClaimedVerifiedStatus
1ViTABLEU (EN-HI)44.6Unverified