| VL-BEiT: Generative Vision-Language Pretraining | Jun 2, 2022 | image-classificationImage Classification | —Unverified | 0 |
| VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation | Feb 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VU-BERT: A Unified framework for Visual Dialog | Feb 22, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Weighted Sampling for Masked Language Modeling | Feb 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Winner Team Mia at TextVQA Challenge 2021: Vision-and-Language Representation Learning with Pre-trained Sequence-to-Sequence Model | Jun 24, 2021 | DecoderLanguage Modeling | —Unverified | 0 |
| WordAlchemy: A transformer-based Reverse Dictionary | Apr 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| XGPT: Cross-modal Generative Pre-Training for Image Captioning | Mar 3, 2020 | Data AugmentationDenoising | —Unverified | 0 |
| XHate-999: Analyzing and Detecting Abusive Language Across Domains and Languages | Dec 1, 2020 | Abusive LanguageDisentanglement | —Unverified | 0 |
| Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters | Jul 1, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 |
| Vision-Language Pre-Training for Boosting Scene Text Detectors | Apr 29, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |