| Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval | Dec 26, 2024 | Image-text RetrievalInformation Retrieval | CodeCode Available | 0 | 5 |
| Semantic-Preserving Augmentation for Robust Image-Text Retrieval | Mar 10, 2023 | Image-text RetrievalRetrieval | CodeCode Available | 0 | 5 |
| HADA: A Graph-based Amalgamation Framework in Image-text Retrieval | Jan 11, 2023 | Graph Neural NetworkImage Retrieval | CodeCode Available | 0 | 5 |
| GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning | Oct 20, 2024 | Image RetrievalImage-text Retrieval | CodeCode Available | 0 | 5 |
| Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval | Nov 5, 2021 | Image-text RetrievalRetrieval | CodeCode Available | 0 | 5 |
| A Vision-Language Foundation Model for Leaf Disease Identification | May 11, 2025 | Contrastive Learningimage-classification | CodeCode Available | 0 | 5 |
| Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval | Jul 17, 2024 | Image-text RetrievalObject | CodeCode Available | 0 | 5 |
| From Unimodal to Multimodal: Scaling up Projectors to Align Modalities | Sep 28, 2024 | Image-text RetrievalSemantic Similarity | CodeCode Available | 0 | 5 |
| Adding simple structure at inference improves Vision-Language Compositionality | Jun 11, 2025 | AttributeImage-text Retrieval | CodeCode Available | 0 | 5 |
| NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings | Jan 7, 2023 | Cross-Modal RetrievalImage-text Retrieval | CodeCode Available | 0 | 5 |