CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation May 30, 2025 Benchmarking Machine Translation
— Unverified 0Multimodal Machine Translation with Visual Scene Graph Pruning May 26, 2025 Machine Translation Multimodal Machine Translation
— Unverified 0TopicVD: A Topic-Based Dataset of Video-Guided Multimodal Machine Translation for Documentaries May 9, 2025 Domain Adaptation Machine Translation
Code Code Available 0Florenz: Scaling Laws for Systematic Generalization in Vision-Language Models Mar 12, 2025 Cross-Lingual Transfer Image Captioning
— Unverified 0Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation Dec 17, 2024 Language Modeling Language Modelling
— Unverified 0EMMeTT: Efficient Multimodal Machine Translation Training Sep 20, 2024 automatic-speech-translation Decoder
— Unverified 0Towards Zero-Shot Multimodal Machine Translation Jul 18, 2024 Language Modelling Machine Translation
Code Code Available 03AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset Apr 29, 2024 Machine Translation Multimodal Machine Translation
Code Code Available 1Exploring the Necessity of Visual Modality in Multimodal Machine Translation using Authentic Datasets Apr 9, 2024 Machine Translation Multimodal Machine Translation
— Unverified 0Detecting Concrete Visual Tokens for Multimodal Machine Translation Mar 5, 2024 Machine Translation Multimodal Machine Translation
— Unverified 0Adding Multimodal Capabilities to a Text-only Translation Model Mar 5, 2024 Machine Translation Multimodal Machine Translation
— Unverified 0The Case for Evaluating Multimodal Translation Models on Text Datasets Mar 5, 2024 Descriptive Image Captioning
— Unverified 0Seamless: Multilingual Expressive and Streaming Speech Translation Dec 8, 2023 automatic-speech-translation Machine Translation
Code Code Available 6Video-Helpful Multimodal Machine Translation Oct 31, 2023 Machine Translation Multimodal Machine Translation
Code Code Available 0Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs Oct 26, 2023 Attribute Machine Translation
Code Code Available 0Bridging the Gap between Synthetic and Authentic Images for Multimodal Machine Translation Oct 20, 2023 Decoder Image Generation
Code Code Available 0CLIPTrans: Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation Aug 29, 2023 Image Captioning Machine Translation
Code Code Available 1A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation Jun 12, 2023 Image Captioning Machine Translation
— Unverified 0HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language May 28, 2023 Machine Translation Multimodal Machine Translation
Code Code Available 0BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation May 23, 2023 Contrastive Learning Machine Translation
Code Code Available 1Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination May 20, 2023 Hallucination Machine Translation
Code Code Available 1Iterative Adversarial Attack on Image-guided Story Ending Generation May 16, 2023 Adversarial Attack Adversarial Robustness
— Unverified 0Generalization algorithm of multimodal pre-training model based on graph-text self-supervised training Feb 16, 2023 Machine Translation Multimodal Machine Translation
— Unverified 0Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation Dec 20, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 0Tackling Ambiguity with Images: Improved Multimodal Machine Translation and Contrastive Evaluation Dec 20, 2022 Machine Translation Multimodal Machine Translation
Code Code Available 1