| New Benchmark Dataset and Fine-Grained Cross-Modal Fusion Framework for Vietnamese Multimodal Aspect-Category Sentiment Analysis | May 1, 2024 | Aspect Category Sentiment AnalysisMultimodal Sentiment Analysis | CodeCode Available | 1 |
| UIT-OpenViIC: A Novel Benchmark for Evaluating Image Captioning in Vietnamese | May 7, 2023 | Image CaptioningVietnamese Image Captioning | —Unverified | 0 |
| EVJVQA Challenge: Multilingual Visual Question Answering | Feb 23, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual Question Answering in Vietnamese | May 7, 2023 | Information RetrievalQuestion Answering | CodeCode Available | 0 |
| ViTextVQA: A Large-Scale Visual Question Answering Dataset for Evaluating Vietnamese Text Comprehension in Images | Apr 16, 2024 | Multimodal Deep LearningOptical Character Recognition (OCR) | CodeCode Available | 0 |