More Robust Dense Retrieval with Contrastive Dual Learning Jul 16, 2021 Contrastive Learning Information Retrieval
Code Code Available 1Dynamic Modality Interaction Modeling for Image-Text Retrieval Jul 11, 2021 cross-modal alignment Cross-Modal Retrieval
Code Code Available 1CLIP2Video: Mastering Video-Text Retrieval via Image CLIP Jun 21, 2021 Language Modeling Language Modelling
Code Code Available 1CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback Jun 19, 2021 Image Retrieval Image-text Retrieval
Code Code Available 1A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval Jun 4, 2021 Graph Matching Image Retrieval
Code Code Available 1Learning Relation Alignment for Calibrated Cross-modal Retrieval May 28, 2021 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval Apr 18, 2021 Retrieval Text Retrieval
Code Code Available 1Condenser: a Pre-training Architecture for Dense Retrieval Apr 16, 2021 Language Modelling Retrieval
Code Code Available 1Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling Apr 14, 2021 GPU Re-Ranking
Code Code Available 1Understanding Hard Negatives in Noise Contrastive Estimation Apr 13, 2021 Entity Linking Retrieval
Code Code Available 1Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning Apr 7, 2021 Representation Learning Retrieval
Code Code Available 1Scene Text Retrieval via Joint Text Detection and Similarity Learning Apr 4, 2021 Retrieval Scene Text Detection
Code Code Available 1Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval Apr 1, 2021 Retrieval Text Retrieval
Code Code Available 1Kaleido-BERT: Vision-Language Pre-training on Fashion Domain Mar 30, 2021 Image Retrieval Retrieval
Code Code Available 1A Comprehensive Review of the Video-to-Text Problem Mar 27, 2021 Question Answering Retrieval
Code Code Available 1VLGrammar: Grounded Grammar Induction of Vision and Language Mar 24, 2021 Clustering Contrastive Learning
Code Code Available 1LightningDOT: Pre-training Visual-Semantic Embeddings for Real-Time Image-Text Retrieval Mar 16, 2021 Image-text Retrieval Re-Ranking
Code Code Available 1A Data-Centric Framework for Composable NLP Workflows Mar 2, 2021 Retrieval Text Retrieval
Code Code Available 1Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits Feb 12, 2021 CPU Document Ranking
Code Code Available 1Rethink Training of BERT Rerankers in Multi-Stage Retrieval Pipeline Jan 21, 2021 Retrieval Text Retrieval
Code Code Available 1GLoRIA: A Multimodal Global-Local Representation Learning Framework for Label-Efficient Medical Image Recognition Jan 1, 2021 Image-text Retrieval Medical Image Analysis
Code Code Available 1Learning the Best Pooling Strategy for Visual Semantic Embedding Nov 9, 2020 Cross-Modal Information Retrieval Image-text Retrieval
Code Code Available 1A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports Sep 3, 2020 Image-text Retrieval Medical Visual Question Answering
Code Code Available 1Consensus-Aware Visual-Semantic Embedding for Image-Text Matching Jul 17, 2020 Image Captioning Image-text matching
Code Code Available 1Language-agnostic BERT Sentence Embedding Jul 3, 2020 Language Modeling Language Modelling
Code Code Available 1Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval Jul 1, 2020 Contrastive Learning Passage Retrieval
Code Code Available 1Graph Optimal Transport for Cross-Domain Alignment Jun 26, 2020 Graph Matching Image Captioning
Code Code Available 1Large-Scale Adversarial Training for Vision-and-Language Representation Learning Jun 11, 2020 Image-text Retrieval Question Answering
Code Code Available 1Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers Apr 2, 2020 Image-text matching Image-text Retrieval
Code Code Available 1IMRAM: Iterative Matching with Recurrent Attention Memory for Cross-Modal Image-Text Retrieval Mar 8, 2020 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 1Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning Mar 1, 2020 Cross-Modal Retrieval Retrieval
Code Code Available 1Knowledge Guided Text Retrieval and Reading for Open Domain Question Answering Nov 10, 2019 Natural Questions Open-Domain Question Answering
Code Code Available 1Cross-modal Scene Graph Matching for Relationship-aware Image-Text Retrieval Oct 11, 2019 Graph Matching Image-text Retrieval
Code Code Available 1UNITER: UNiversal Image-TExt Representation Learning Sep 25, 2019 Image-text matching Image-text Retrieval
Code Code Available 1XQA: A Cross-lingual Open-domain Question Answering Dataset Jul 1, 2019 Machine Translation Open-Domain Question Answering
Code Code Available 1Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval Jun 11, 2019 Cross-Modal Retrieval Multiple Instance Learning
Code Code Available 1Learning a Text-Video Embedding from Incomplete and Heterogeneous Data Apr 7, 2018 Retrieval Text Retrieval
Code Code Available 1Stacked Cross Attention for Image-Text Matching Mar 21, 2018 Cross-Modal Retrieval Image Retrieval
Code Code Available 1Maximal Matching Matters: Preventing Representation Collapse for Robust Cross-Modal Retrieval Jun 26, 2025 Cross-Modal Retrieval Image-text Retrieval
— Unverified 0Tree-Based Text Retrieval via Hierarchical Clustering in RAGFrameworks: Application on Taiwanese Regulations Jun 16, 2025 RAG Retrieval
Code Code Available 0MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling Jun 12, 2025 16k Retrieval
Code Code Available 0Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration Jun 12, 2025 cross-modal alignment Image to text
— Unverified 0Adding simple structure at inference improves Vision-Language Compositionality Jun 11, 2025 Attribute Image-text Retrieval
Code Code Available 0Beyond Cropped Regions: New Benchmark and Corresponding Baseline for Chinese Scene Text Retrieval in Diverse Layouts Jun 5, 2025 Retrieval Text Retrieval
— Unverified 0Attacking Attention of Foundation Models Disrupts Downstream Tasks Jun 3, 2025 Depth Estimation Image-text Retrieval
Code Code Available 0ERU-KG: Efficient Reference-aligned Unsupervised Keyphrase Generation May 30, 2025 Informativeness Keyphrase Generation
Code Code Available 0MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval May 26, 2025 Image Retrieval Large Language Model
— Unverified 0Distill CLIP (DCLIP): Enhancing Image-Text Retrieval via Cross-Modal Transformer Distillation May 25, 2025 Contrastive Learning Image-text Retrieval
— Unverified 0EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models May 24, 2025 Image-text Retrieval Language Modeling
— Unverified 0Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval May 22, 2025 cross-modal alignment Image-text Retrieval
— Unverified 0