| How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval? | Jul 10, 2024 | Contrastive LearningImage-text Retrieval | —Unverified | 0 |
| Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning | Jun 26, 2024 | Contrastive LearningCross-Modal Retrieval | CodeCode Available | 0 |
| Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval | Jun 9, 2024 | Image-text RetrievalPerson Retrieval | —Unverified | 0 |
| Knowledge-grounded Adaptation Strategy for Vision-language Models: Building Unique Case-set for Screening Mammograms for Residents Training | May 30, 2024 | Image-text RetrievalLanguage Modeling | —Unverified | 0 |
| Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships | May 29, 2024 | Adversarial DefenseAdversarial Robustness | —Unverified | 0 |
| Active Learning for Finely-Categorized Image-Text Retrieval by Selecting Hard Negative Unpaired Samples | May 25, 2024 | Active LearningImage-text Retrieval | —Unverified | 0 |
| Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval | May 14, 2024 | Cross-Modal RetrievalCross-Modal Retrieval on RSITMD | —Unverified | 0 |
| UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation | Apr 22, 2024 | DiversityDomain Adaptation | —Unverified | 0 |
| Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement | Apr 6, 2024 | Image-text Retrievalobject-detection | —Unverified | 0 |
| LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival | Mar 16, 2024 | Caption GenerationImage-text Retrieval | —Unverified | 0 |