Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval Feb 24, 2020 Cross-Modal Retrieval Image Retrieval
Code Code Available 15 Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval Aug 24, 2023 Cross-Modal Retrieval Image-text matching
Code Code Available 15 PaLI-3 Vision Language Models: Smaller, Faster, Stronger Oct 13, 2023 Chart Question Answering Cross-Modal Retrieval
Code Code Available 15 Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval Oct 19, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 15 BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping Oct 29, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 15 Learning Dual Semantic Relations with Graph Attention for Image-Text Matching Oct 22, 2020 Cross-Modal Retrieval Graph Attention
Code Code Available 15 Adaptive label-aware graph convolutional networks for cross-modal retrieval Aug 6, 2021 Cross-Modal Retrieval Representation Learning
Code Code Available 15 Learning Semantic Relationship Among Instances for Image-Text Matching Jan 1, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 15 CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval May 29, 2024 Cross-Modal Retrieval Image Retrieval
Code Code Available 15 Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval Mar 8, 2024 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 15 Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query Mar 2, 2021 Cross-Modal Retrieval Image Retrieval
Code Code Available 15 A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval Dec 6, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 15 Cross-modal Retrieval for Knowledge-based Visual Question Answering Jan 11, 2024 Cross-Modal Retrieval Question Answering
Code Code Available 15 Cross-Modal Retrieval for Motion and Text via DopTriple Loss May 7, 2023 Cross-Modal Retrieval Retrieval
Code Code Available 15 M3-Jepa: Multimodal Alignment via Multi-directional MoE based on the JEPA framework Sep 9, 2024 Computational Efficiency Cross-Modal Retrieval
Code Code Available 15 Learning with Noisy Correspondence for Cross-modal Matching Dec 1, 2021 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 15 Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective Dec 8, 2023 Cross-Modal Retrieval Data Augmentation
Code Code Available 15 Normalized Contrastive Learning for Text-Video Retrieval Nov 30, 2022 Contrastive Learning Cross-Modal Retrieval
Code Code Available 15 Noisy Correspondence Learning with Meta Similarity Correction Apr 13, 2023 Binary Classification Cross-Modal Retrieval
Code Code Available 15 Cross-Modal Retrieval with Partially Mismatched Pairs Feb 22, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 15 On Metric Learning for Audio-Text Cross-Modal Retrieval Mar 29, 2022 AudioCaps Cross-Modal Retrieval
Code Code Available 15 mCLIP: Multilingual CLIP via Cross-lingual Transfer Jul 10, 2023 Contrastive Learning Cross-Lingual Transfer
Code Code Available 15 Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention Nov 21, 2022 Cross-Modal Retrieval Language Modeling
Code Code Available 15 Cross-modal transformers for infrared and visible image fusion Jun 26, 2023 Cross-Modal Retrieval Depth Estimation
Code Code Available 15 MusCaps: Generating Captions for Music Audio Apr 24, 2021 Audio captioning Classification
Code Code Available 15 UniVSE: Robust Visual Semantic Embeddings via Structured Semantic Representations Apr 11, 2019 Contrastive Learning Cross-Modal Retrieval
Code Code Available 15 Multimodal Foundation Models For Echocardiogram Interpretation Aug 29, 2023 Cross-Modal Retrieval Diagnostic
Code Code Available 15 An Empirical Study of CLIP for Text-based Person Search Aug 19, 2023 Cross-Modal Retrieval Data Augmentation
Code Code Available 15 Multimodal Metric Learning for Tag-based Music Retrieval Oct 30, 2020 Cross-Modal Retrieval Metric Learning
Code Code Available 15 Nearest Neighbor Normalization Improves Multimodal Retrieval Oct 31, 2024 Cross-Modal Retrieval Image Captioning
Code Code Available 15 Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models May 31, 2023 Cross-Modal Retrieval Question Answering
Code Code Available 15 CLIP-KD: An Empirical Study of CLIP Model Distillation Jul 24, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 15 Disentangling and Generating Modalities for Recommendation in Missing Modality Scenarios Apr 23, 2025 Cross-Modal Retrieval Recommendation Systems
Code Code Available 15 Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning Feb 21, 2024 Cross-Modal Retrieval Image Captioning
Code Code Available 15 An Empirical Study of Training End-to-End Vision-and-Language Transformers Nov 3, 2021 Cross-Modal Retrieval Decoder
Code Code Available 15 Deep Evidential Learning with Noisy Correspondence for Cross-Modal Retrieval Oct 10, 2022 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 15 Dual adversarial graph neural networks for multi-label cross-modal retrieval May 18, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 15 Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval Jun 22, 2021 Cross-Modal Retrieval Diversity
Code Code Available 15 Order-Embeddings of Images and Language Nov 19, 2015 Cross-Modal Retrieval Image Captioning
Code Code Available 15 A Comprehensive Empirical Study of Vision-Language Pre-trained Model for Supervised Cross-Modal Retrieval Jan 8, 2022 Cross-Modal Retrieval Information Retrieval
Code Code Available 15 COBRA: Contrastive Bi-Modal Representation Algorithm May 7, 2020 Cross-Modal Retrieval Image Captioning
Code Code Available 15 Dynamic Modality Interaction Modeling for Image-Text Retrieval Jul 11, 2021 cross-modal alignment Cross-Modal Retrieval
Code Code Available 15 FedCMR: Federated Cross-Modal Retrieval Jul 1, 2021 Cross-Modal Retrieval Federated Learning
Code Code Available 15 Multi-Label Cross-Modal Retrieval Dec 1, 2015 Cross-Modal Retrieval Retrieval
Code Code Available 15 CodeCMR: Cross-Modal Retrieval For Function-Level Binary Source Code Matching Dec 1, 2020 Computer Security Cross-Modal Retrieval
Code Code Available 15 Emotion Embedding Spaces for Matching Music to Stories Nov 26, 2021 Cross-Modal Retrieval Metric Learning
Code Code Available 15 Neural Methods for Point-wise Dependency Estimation Jun 9, 2020 Cross-Modal Retrieval Representation Learning
Code Code Available 15 Probabilistic Embeddings for Cross-Modal Retrieval Jan 13, 2021 Cross-Modal Retrieval Retrieval
Code Code Available 15 Plug-and-Play Regulators for Image-Text Matching Mar 23, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 15 Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information Apr 21, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 15