Rethinking Benchmarks for Cross-modal Image-text Retrieval Apr 21, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval Mar 22, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 1Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification Aug 4, 2022 Cross-Modal Retrieval Person Re-Identification
Code Code Available 1Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval Oct 19, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 1More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval Mar 25, 2021 All Cross-Modal Retrieval
Code Code Available 1IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval Mar 8, 2020 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents Dec 10, 2024 Cross-Modal Retrieval Image Classification
Code Code Available 1Integrating multi-label contrastive learning with dual adversarial graph neural networks for cross-modal retrieval Jul 5, 2022 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval May 29, 2024 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Similarity Reasoning and Filtration for Image-Text Matching Jan 5, 2021 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Graph Structured Network for Image-Text Matching Apr 1, 2020 Attribute Cross-Modal Retrieval
Code Code Available 1Fusion and Orthogonal Projection for Improved Face-Voice Association Dec 20, 2021 Cross-Modal Retrieval Triplet
Code Code Available 1Cross-modal Retrieval for Knowledge-based Visual Question Answering Jan 11, 2024 Cross-Modal Retrieval Question Answering
Code Code Available 1Cross-Modal Retrieval for Motion and Text via DopTriple Loss May 7, 2023 Cross-Modal Retrieval Retrieval
Code Code Available 1M3-Jepa: Multimodal Alignment via Multi-directional MoE based on the JEPA framework Sep 9, 2024 Computational Efficiency Cross-Modal Retrieval
Code Code Available 1Stacked Cross Attention for Image-Text Matching Mar 21, 2018 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective Dec 8, 2023 Cross-Modal Retrieval Data Augmentation
Code Code Available 1Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval Dec 8, 2022 Cross-Modal Retrieval Food Recognition
Code Code Available 1IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages Jan 27, 2022 Cross-Modal Retrieval Few-Shot Learning
Code Code Available 1Cross-Modal Retrieval with Partially Mismatched Pairs Feb 22, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1Cross Modal Retrieval with Querybank Normalisation Dec 23, 2021 Cross-Modal Retrieval Metric Learning
Code Code Available 1Text-Based Person Search with Limited Data Oct 20, 2021 Benchmarking Contrastive Learning
Code Code Available 1Knowledge-enhanced Visual-Language Pretraining for Computational Pathology Apr 15, 2024 Cross-Modal Retrieval Language Modeling
Code Code Available 1Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning Mar 1, 2020 Cross-Modal Retrieval Retrieval
Code Code Available 1UGNCL: Uncertainty-Guided Noisy Correspondence Learning for Efficient Cross-Modal Matching Jul 11, 2024 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 1UniVSE: Robust Visual Semantic Embeddings via Structured Semantic Representations Apr 11, 2019 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks Mar 4, 2023 Cross-Modal Retrieval Image Captioning
Code Code Available 1An Empirical Study of CLIP for Text-based Person Search Aug 19, 2023 Cross-Modal Retrieval Data Augmentation
Code Code Available 1FashionBERT: Text and Image Matching with Adaptive Loss for Cross-modal Retrieval May 20, 2020 Cross-Modal Retrieval Retrieval
Code Code Available 1Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders Aug 12, 2020 Cross-Modal Information Retrieval Cross-Modal Retrieval
Code Code Available 1End-to-end Knowledge Retrieval with Multi-modal Queries Jun 1, 2023 Benchmarking Cross-Modal Retrieval
Code Code Available 1CLIP-KD: An Empirical Study of CLIP Model Distillation Jul 24, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models May 31, 2023 Cross-Modal Retrieval Question Answering
Code Code Available 1FedCMR: Federated Cross-Modal Retrieval Jul 1, 2021 Cross-Modal Retrieval Federated Learning
Code Code Available 1An Empirical Study of Training End-to-End Vision-and-Language Transformers Nov 3, 2021 Cross-Modal Retrieval Decoder
Code Code Available 1Deep Evidential Learning with Noisy Correspondence for Cross-Modal Retrieval Oct 10, 2022 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 1Fuzzy Multimodal Learning for Trusted Cross-modal Retrieval Jan 1, 2025 Cross-Modal Retrieval Retrieval
Code Code Available 1GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis Feb 13, 2025 Cross-Modal Retrieval Image Captioning
Code Code Available 1A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval Jan 8, 2022 Cross-Modal Retrieval Information Retrieval
Code Code Available 1Image-text Retrieval via Preserving Main Semantics of Vision Apr 20, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1COBRA: Contrastive Bi-Modal Representation Algorithm May 7, 2020 Cross-Modal Retrieval Image Captioning
Code Code Available 1Improving Cross-Modal Retrieval with Set of Diverse Embeddings Nov 30, 2022 Cross-Modal Retrieval Retrieval
Code Code Available 1Dynamic Modality Interaction Modeling for Image-Text Retrieval Jul 11, 2021 cross-modal alignment Cross-Modal Retrieval
Code Code Available 1Florence: A New Foundation Model for Computer Vision Nov 22, 2021 Action Classification Action Recognition
Code Code Available 1CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code Matching Dec 1, 2020 Computer Security Cross-Modal Retrieval
Code Code Available 1Learning Cross-Modal Retrieval With Noisy Labels Jun 19, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 1Learning Relation Alignment for Calibrated Cross-modal Retrieval May 28, 2021 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Learning Semantic Relationship Among Instances for Image-Text Matching Jan 1, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval Mar 8, 2024 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 1Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts Nov 16, 2021 Cross-Modal Retrieval Image Captioning
Code Code Available 1