| ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding | Oct 23, 2020 | cross-modal alignmentLanguage Modeling | —Unverified | 0 | 0 |
| ViLTA: Enhancing Vision-Language Pre-training through Textual Augmentation | Aug 31, 2023 | Image-text matchingLanguage Modeling | —Unverified | 0 | 0 |
| Bi-Granularity Contrastive Learning for Post-Training in Few-Shot Scene | Jun 4, 2021 | Contrastive LearningData Augmentation | —Unverified | 0 | 0 |
| Adversarial Generation and Encoding of Nested Texts | Jun 1, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| STT: Soft Template Tuning for Few-Shot Learning | Jan 16, 2022 | Few-Shot LearningLanguage Modeling | —Unverified | 0 | 0 |
| STT: Soft Template Tuning for Few-Shot Adaptation | Jul 18, 2022 | Few-Shot LearningLanguage Modeling | —Unverified | 0 | 0 |
| VL-BEiT: Generative Vision-Language Pretraining | Jun 2, 2022 | image-classificationImage Classification | —Unverified | 0 | 0 |
| Bidirectional Language Models Are Also Few-shot Learners | Sep 29, 2022 | DenoisingLanguage Modeling | —Unverified | 0 | 0 |
| Bias Vector: Mitigating Biases in Language Models with Task Arithmetic Approach | Dec 16, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation | Feb 5, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |