Seamless: Multilingual Expressive and Streaming Speech Translation Dec 8, 2023 automatic-speech-translation Machine Translation
Code Code Available 6Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 33AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset Apr 29, 2024 Machine Translation Multimodal Machine Translation
Code Code Available 1CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation Aug 29, 2023 Image Captioning Machine Translation
Code Code Available 1BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation May 23, 2023 Contrastive Learning Machine Translation
Code Code Available 1Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination May 20, 2023 Hallucination Machine Translation
Code Code Available 1Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation Dec 20, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 1Distill the Image to Nowhere: Inversion Knowledge Distillation for Multimodal Machine Translation Oct 10, 2022 Knowledge Distillation Machine Translation
Code Code Available 1VALHALLA: Visual Hallucination for Machine Translation May 31, 2022 Hallucination Machine Translation
Code Code Available 1Neural Machine Translation with Phrase-Level Universal Visual Representations Mar 19, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 1On Vision Features in Multimodal Machine Translation Mar 17, 2022 Image Captioning Machine Translation
Code Code Available 1MSCTD: A Multimodal Sentiment Chat Translation Dataset Feb 28, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 1VISA: An Ambiguous Subtitles Dataset for Visual Scene-Aware Machine Translation Jan 20, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 1BERTGEN: Multi-task Generation through BERT Jun 7, 2021 Decoder Image Captioning
Code Code Available 1Cross-lingual Visual Pre-training for Multimodal Machine Translation Jan 25, 2021 Language Modelling Machine Translation
Code Code Available 1Dynamic Context-guided Capsule Network for Multimodal Machine Translation Sep 4, 2020 Decoder Machine Translation
Code Code Available 1Multimodal Transformer for Multimodal Machine Translation Jul 1, 2020 Machine Translation Multimodal Machine Translation
Code Code Available 1Self-Knowledge Distillation with Progressive Refinement of Targets Jun 22, 2020 image-classification Image Classification
Code Code Available 1M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training Jun 4, 2020 Image Captioning Image Retrieval
Code Code Available 1CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation May 30, 2025 Benchmarking Machine Translation
— Unverified 0Multimodal Machine Translation with Visual Scene Graph Pruning May 26, 2025 Machine Translation Multimodal Machine Translation
— Unverified 0TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for Documentaries May 9, 2025 Domain Adaptation Machine Translation
Code Code Available 0Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models Mar 12, 2025 Cross-Lingual Transfer Image Captioning
— Unverified 0Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation Dec 17, 2024 Language Modeling Language Modelling
— Unverified 0EMMeTT: Efficient Multimodal Machine Translation Training Sep 20, 2024 automatic-speech-translation Decoder
— Unverified 0