Seamless: Multilingual Expressive and Streaming Speech Translation Dec 8, 2023 automatic-speech-translation Machine Translation
Code Code Available 6Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 3On Vision Features in Multimodal Machine Translation Mar 17, 2022 Image Captioning Machine Translation
Code Code Available 1Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation Dec 20, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 1VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation Jan 20, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 1BERTGEN: Multi-task Generation through BERT Jun 7, 2021 Decoder Image Captioning
Code Code Available 1MSCTD: A Multimodal Sentiment Chat Translation Dataset Feb 28, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 1CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation Aug 29, 2023 Image Captioning Machine Translation
Code Code Available 1Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination May 20, 2023 Hallucination Machine Translation
Code Code Available 1Self-Knowledge Distillation with Progressive Refinement of Targets Jun 22, 2020 image-classification Image Classification
Code Code Available 1Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation Oct 10, 2022 Knowledge Distillation Machine Translation
Code Code Available 1BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation May 23, 2023 Contrastive Learning Machine Translation
Code Code Available 1Dynamic Context-guided Capsule Network for Multimodal Machine Translation Sep 4, 2020 Decoder Machine Translation
Code Code Available 1Multimodal Transformer for Multimodal Machine Translation Jul 1, 2020 Machine Translation Multimodal Machine Translation
Code Code Available 1VALHALLA: Visual Hallucination for Machine Translation May 31, 2022 Hallucination Machine Translation
Code Code Available 1Cross-lingual Visual Pre-training for Multimodal Machine Translation Jan 25, 2021 Language Modelling Machine Translation
Code Code Available 13AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset Apr 29, 2024 Machine Translation Multimodal Machine Translation
Code Code Available 1M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training Jun 4, 2020 Image Captioning Image Retrieval
Code Code Available 1Neural Machine Translation with Phrase-Level Universal Visual Representations Mar 19, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 1CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation May 30, 2025 Benchmarking Machine Translation
— Unverified 0A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation Jun 12, 2023 Image Captioning Machine Translation
— Unverified 0A Shared Task on Multimodal Machine Translation and Crosslingual Image Description Aug 1, 2016 Image Description Image Retrieval
— Unverified 0A Dataset and Reranking Method for Multimodal MT of User-Generated Image Captions Mar 1, 2018 Image Captioning Machine Translation
— Unverified 0Doubly Attentive Transformer Machine Translation Jul 30, 2018 Decoder Image Captioning
— Unverified 0Adaptive Fusion Techniques for Multimodal Data Nov 10, 2019 Emotion Recognition Machine Translation
— Unverified 0