Seamless: Multilingual Expressive and Streaming Speech Translation Dec 8, 2023 automatic-speech-translation Machine Translation
Code Code Available 65 Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 35 Self-Knowledge Distillation with Progressive Refinement of Targets Jun 22, 2020 image-classification Image Classification
Code Code Available 15 Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination May 20, 2023 Hallucination Machine Translation
Code Code Available 15 VALHALLA: Visual Hallucination for Machine Translation May 31, 2022 Hallucination Machine Translation
Code Code Available 15 BERTGEN: Multi-task Generation through BERT Jun 7, 2021 Decoder Image Captioning
Code Code Available 15 M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training Jun 4, 2020 Image Captioning Image Retrieval
Code Code Available 15 MSCTD: A Multimodal Sentiment Chat Translation Dataset Feb 28, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 15 CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation Aug 29, 2023 Image Captioning Machine Translation
Code Code Available 15 Dynamic Context-guided Capsule Network for Multimodal Machine Translation Sep 4, 2020 Decoder Machine Translation
Code Code Available 15 BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation May 23, 2023 Contrastive Learning Machine Translation
Code Code Available 15 Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation Dec 20, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 15 On Vision Features in Multimodal Machine Translation Mar 17, 2022 Image Captioning Machine Translation
Code Code Available 15 VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation Jan 20, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 15 Cross-lingual Visual Pre-training for Multimodal Machine Translation Jan 25, 2021 Language Modelling Machine Translation
Code Code Available 15 Neural Machine Translation with Phrase-Level Universal Visual Representations Mar 19, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 15 Multimodal Transformer for Multimodal Machine Translation Jul 1, 2020 Machine Translation Multimodal Machine Translation
Code Code Available 15 3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset Apr 29, 2024 Machine Translation Multimodal Machine Translation
Code Code Available 15 Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation Oct 10, 2022 Knowledge Distillation Machine Translation
Code Code Available 15 Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation Oct 20, 2023 Decoder Image Generation
Code Code Available 05 UMONS Submission for WMT18 Multimodal Translation Task Oct 15, 2018 Image Captioning Machine Translation
Code Code Available 05 Does Multimodality Help Human and Machine for Translation and Image Captioning? May 30, 2016 Image Captioning Image Description
Code Code Available 05 HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language May 28, 2023 Machine Translation Multimodal Machine Translation
Code Code Available 05 Distilling Translations with Visual Awareness Jun 18, 2019 Decoder Machine Translation
Code Code Available 05 Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation Dec 20, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 05 Video-Helpful Multimodal Machine Translation Oct 31, 2023 Machine Translation Multimodal Machine Translation
Code Code Available 05 TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for Documentaries May 9, 2025 Domain Adaptation Machine Translation
Code Code Available 05 Towards Zero-Shot Multimodal Machine Translation Jul 18, 2024 Language Modelling Machine Translation
Code Code Available 05 Findings of the Third Shared Task on Multimodal Machine Translation Oct 1, 2018 Machine Translation Multimodal Machine Translation
Code Code Available 05 NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems Jun 1, 2017 Machine Translation Multimodal Machine Translation
Code Code Available 05 Vision Matters When It Should: Sanity Checking Multimodal Machine Translation Models Sep 8, 2021 Image Captioning Machine Translation
Code Code Available 05 Multimodal Machine Translation with Embedding Prediction Apr 1, 2019 Machine Translation Multimodal Machine Translation
Code Code Available 05 A Visual Attention Grounding Neural Model for Multimodal Machine Translation Aug 24, 2018 Machine Translation Multimodal Machine Translation
Code Code Available 05 Multi30K: Multilingual English-German Image Descriptions May 2, 2016 Image Description Machine Translation
Code Code Available 05 Latent Variable Model for Multi-modal Translation Nov 1, 2018 Decoder Machine Translation
Code Code Available 05 Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs Oct 26, 2023 Attribute Machine Translation
Code Code Available 05 Cultural and Geographical Influences on Image Translatability of Words across Languages Jun 1, 2021 Cultural Vocal Bursts Intensity Prediction Low Resource Neural Machine Translation
Code Code Available 05 Multimodal Lexical Translation May 1, 2018 Machine Translation Multimodal Lexical Translation
Code Code Available 05 ViTA: Visual-Linguistic Translation by Aligning Object Tags Jun 1, 2021 Machine Translation Multimodal Machine Translation
Code Code Available 05 Efficient Object-Level Visual Context Modeling for Multimodal Machine Translation: Masking Irrelevant Objects Helps Grounding Dec 18, 2020 Machine Translation Multimodal Machine Translation
— Unverified 00 Adaptive Fusion Techniques for Multimodal Data Nov 10, 2019 Emotion Recognition Machine Translation
— Unverified 00 CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation May 30, 2025 Benchmarking Machine Translation
— Unverified 00 Doubly Attentive Transformer Machine Translation Jul 30, 2018 Decoder Image Captioning
— Unverified 00 A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation Jun 12, 2023 Image Captioning Machine Translation
— Unverified 00 Doubly-Attentive Decoder for Multi-modal Neural Machine Translation Feb 4, 2017 Decoder Image Description
— Unverified 00 Gumbel-Attention for Multi-modal Machine Translation Mar 16, 2021 Machine Translation Multimodal Machine Translation
— Unverified 00 Grounded Word Sense Translation Jun 1, 2019 Grounded language learning Machine Translation
— Unverified 00 A Shared Task on Multimodal Machine Translation and Crosslingual Image Description Aug 1, 2016 Image Description Image Retrieval
— Unverified 00 A Dataset and Reranking Method for Multimodal MT of User-Generated Image Captions Mar 1, 2018 Image Captioning Machine Translation
— Unverified 00 Good for Misconceived Reasons: Revisiting Neural Multimodal Machine Translation Jan 1, 2021 Machine Translation Multimodal Machine Translation
— Unverified 00