Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval Mar 8, 2024 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 1Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning Feb 21, 2024 Cross-Modal Retrieval Image Captioning
Code Code Available 1Cross-modal Retrieval for Knowledge-based Visual Question Answering Jan 11, 2024 Cross-Modal Retrieval Question Answering
Code Code Available 1TF-CLIP: Learning Text-free CLIP for Video-based Person Re-Identification Dec 15, 2023 Cross-Modal Retrieval Person Re-Identification
Code Code Available 1Enhancing Recipe Retrieval with Foundation Models: A Data Augmentation Perspective Dec 8, 2023 Cross-Modal Retrieval Data Augmentation
Code Code Available 1Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models Nov 27, 2023 Cross-Modal Retrieval Image Generation
Code Code Available 1Weakly supervised cross-modal learning in high-content screening Nov 8, 2023 Cross-Modal Retrieval Drug Discovery
Code Code Available 1BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping Oct 29, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval Oct 27, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1PaLI-3 Vision Language Models: Smaller, Faster, Stronger Oct 13, 2023 Chart Question Answering Cross-Modal Retrieval
Code Code Available 1BioBridge: Bridging Biomedical Foundation Models via Knowledge Graphs Oct 5, 2023 Cross-Modal Retrieval Domain Generalization
Code Code Available 1Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval Sep 29, 2023 Cross-Modal Retrieval Image-text matching
Code Code Available 1Learning Tri-modal Embeddings for Zero-Shot Soundscape Mapping Sep 19, 2023 Cross-Modal Retrieval
Code Code Available 1A Survey on Interpretable Cross-modal Reasoning Sep 5, 2023 Cross-Modal Retrieval Decision Making
Code Code Available 1Multimodal Foundation Models For Echocardiogram Interpretation Aug 29, 2023 Cross-Modal Retrieval Diagnostic
Code Code Available 1Cross-Modal Retrieval: A Systematic Review of Methods and Future Directions Aug 28, 2023 Cross-Modal Retrieval Retrieval
Code Code Available 1Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval Aug 24, 2023 Cross-Modal Retrieval Image-text matching
Code Code Available 1An Empirical Study of CLIP for Text-based Person Search Aug 19, 2023 Cross-Modal Retrieval Data Augmentation
Code Code Available 1Unifying Two-Stream Encoders with Transformers for Cross-Modal Retrieval Aug 8, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 1CLIP-KD: An Empirical Study of CLIP Model Distillation Jul 24, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1mCLIP: Multilingual CLIP via Cross-lingual Transfer Jul 10, 2023 Contrastive Learning Cross-Lingual Transfer
Code Code Available 1Cross-modal transformers for infrared and visible image fusion Jun 26, 2023 Cross-Modal Retrieval Depth Estimation
Code Code Available 1Quilt-1M: One Million Image-Text Pairs for Histopathology Jun 20, 2023 Automatic Speech Recognition Cross-Modal Retrieval
Code Code Available 1Reducing Semantic Confusion: Scene-aware Aggregation Network for Remote Sensing Cross-modal Retrieval Jun 12, 2023 Cross-Modal Retrieval Retrieval
Code Code Available 1End-to-end Knowledge Retrieval with Multi-modal Queries Jun 1, 2023 Benchmarking Cross-Modal Retrieval
Code Code Available 1Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models May 31, 2023 Cross-Modal Retrieval Question Answering
Code Code Available 1Cross-Modal Retrieval for Motion and Text via DopTriple Loss May 7, 2023 Cross-Modal Retrieval Retrieval
Code Code Available 1Rethinking Benchmarks for Cross-modal Image-text Retrieval Apr 21, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Image-text Retrieval via Preserving Main Semantics of Vision Apr 20, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Noisy Correspondence Learning with Meta Similarity Correction Apr 13, 2023 Binary Classification Cross-Modal Retrieval
Code Code Available 1Plug-and-Play Regulators for Image-Text Matching Mar 23, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Single-branch Network for Multimodal Training Mar 10, 2023 Cross-Modal Retrieval Retrieval
Code Code Available 1FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks Mar 4, 2023 Cross-Modal Retrieval Image Captioning
Code Code Available 1Cross-Modal Retrieval with Partially Mismatched Pairs Feb 22, 2023 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1RONO: Robust Discriminative Learning With Noisy Labels for 2D-3D Cross-Modal Retrieval Jan 1, 2023 Cross-Modal Retrieval Learning with noisy labels
Code Code Available 1Learning Semantic Relationship Among Instances for Image-Text Matching Jan 1, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Position-guided Text Prompt for Vision-Language Pre-training Dec 19, 2022 Cross-Modal Retrieval Image Captioning
Code Code Available 1Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval Dec 8, 2022 Cross-Modal Retrieval Food Recognition
Code Code Available 1A Differentiable Semantic Metric Approximation in Probabilistic Embedding for Cross-Modal Retrieval Dec 6, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 1Improving Cross-Modal Retrieval with Set of Diverse Embeddings Nov 30, 2022 Cross-Modal Retrieval Retrieval
Code Code Available 1Normalized Contrastive Learning for Text-Video Retrieval Nov 30, 2022 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval Nov 23, 2022 Cross-Modal Retrieval Retrieval
Code Code Available 1Perceiver-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention Nov 21, 2022 Cross-Modal Retrieval Language Modeling
Code Code Available 1Cross-Modal Fusion Distillation for Fine-Grained Sketch-Based Image Retrieval Oct 19, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Deep Evidential Learning with Noisy Correspondence for Cross-Modal Retrieval Oct 10, 2022 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
Code Code Available 1Learning to Evaluate Performance of Multi-modal Semantic Localization Sep 14, 2022 Cross-Modal Retrieval Referring Expression
Code Code Available 1A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language Sep 12, 2022 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1Cross-Lingual Cross-Modal Retrieval with Noise-Robust Learning Aug 26, 2022 Cross-Modal Retrieval Machine Translation
Code Code Available 1Learning Modal-Invariant and Temporal-Memory for Video-based Visible-Infrared Person Re-Identification Aug 4, 2022 Cross-Modal Retrieval Person Re-Identification
Code Code Available 1Integrating multi-label contrastive learning with dual adversarial graph neural networks for cross-modal retrieval Jul 5, 2022 Contrastive Learning Cross-Modal Retrieval
Code Code Available 1