NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings Jan 7, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning Jan 30, 2024 Diversity Image-text Retrieval
Code Code Available 0Reproducibility, Replicability, and Insights into Visual Document Retrieval with Late Interaction May 12, 2025 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision Apr 26, 2019 Image-text Retrieval Object
Code Code Available 0MultiWay-Adapater: Adapting large-scale multi-modal models for scalable image-text retrieval Sep 4, 2023 Image-text Retrieval Retrieval
Code Code Available 0VL-Taboo: An Analysis of Attribute-based Zero-shot Capabilities of Vision-Language Models Sep 12, 2022 Attribute Image-text Retrieval
Code Code Available 0Multi-stage Pre-training over Simplified Multimodal Pre-training Models Jul 22, 2021 Image-text Retrieval Retrieval
Code Code Available 0Retrieval Augmentation for Deep Neural Networks Feb 25, 2021 Image Captioning Retrieval
Code Code Available 0Diving Deep into the Motion Representation of Video-Text Models Jun 7, 2024 Retrieval Text Retrieval
Code Code Available 0Dissecting Deep Metric Learning Losses for Image-Text Retrieval Oct 21, 2022 Cross-Modal Retrieval Image-text matching
Code Code Available 0Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering Dec 19, 2024 Contrastive Learning Language Modeling
Code Code Available 0Reversed in Time: A Novel Temporal-Emphasized Benchmark for Cross-Modal Video-Text Retrieval Dec 26, 2024 Image-text Retrieval Information Retrieval
Code Code Available 0Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis Feb 11, 2023 Image-text Retrieval Knowledge Graphs
Code Code Available 0Multilingual Vision-Language Pre-training for the Remote Sensing Domain Oct 30, 2024 Cross-Modal Retrieval image-classification
Code Code Available 0MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian Jun 20, 2023 Cross-Lingual Transfer Retrieval
Code Code Available 0MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling Jun 12, 2025 16k Retrieval
Code Code Available 0AugTriever: Unsupervised Dense Retrieval and Domain Adaptation by Scalable Data Augmentation Dec 17, 2022 Data Augmentation Domain Adaptation
Code Code Available 0Exposing and Mitigating Spurious Correlations for Cross-Modal Retrieval Apr 6, 2023 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0Towards a text-based quantitative and explainable histopathology image analysis Jul 10, 2024 image-classification Image Classification
Code Code Available 0MODOC: A Modular Interface for Flexible Interlinking of Text Retrieval and Text Generation Functions Aug 26, 2024 Information Retrieval Retrieval
Code Code Available 0Design of the topology for contrastive visual-textual alignment Sep 5, 2022 Contrastive Learning Image-to-Text Retrieval
Code Code Available 0Rudder: A Cross Lingual Video and Text Retrieval Dataset Mar 9, 2021 Natural Language Queries Retrieval
Code Code Available 0Modelling Stopping Criteria for Search Results using Poisson Processes Sep 13, 2019 Retrieval Text Retrieval
Code Code Available 0Exploiting Positional Bias for Query-Agnostic Generative Content in Search May 1, 2024 Position Text Retrieval
Code Code Available 0Mistral-SPLADE: LLMs for better Learned Sparse Retrieval Aug 20, 2024 Decoder Language Modeling
Code Code Available 0An Unsupervised Cross-Modal Hashing Method Robust to Noisy Training Image-Text Correspondences in Remote Sensing Feb 26, 2022 Image-text Retrieval Meta-Learning
Code Code Available 0Towards Robust Text Retrieval with Progressive Learning Nov 20, 2023 Machine Reading Comprehension Question Answering
Code Code Available 0MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding Jan 11, 2020 Image Captioning Image-text Retrieval
Code Code Available 0USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval Jan 17, 2023 Contrastive Learning Image-text Retrieval
Code Code Available 0MeTA: A Unified Toolkit for Text Retrieval and Analysis Aug 1, 2016 Document Classification Information Retrieval
Code Code Available 0Explaining Text Similarity in Transformer Models May 10, 2024 Information Retrieval Retrieval
Code Code Available 0Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval Jun 11, 2018 Image-text Retrieval Retrieval
Code Code Available 0Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes Jan 23, 2025 Emotion Classification Image Captioning
Code Code Available 0Expertized Caption Auto-Enhancement for Video-Text Retrieval Feb 5, 2025 Caption Generation Retrieval
Code Code Available 0A Binary Variational Autoencoder for Hashing Oct 22, 2019 Quantization Retrieval
Code Code Available 0Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark Feb 14, 2022 Benchmarking Contrastive Learning
Code Code Available 0Semantic-Preserving Augmentation for Robust Image-Text Retrieval Mar 10, 2023 Image-text Retrieval Retrieval
Code Code Available 0Adding simple structure at inference improves Vision-Language Compositionality Jun 11, 2025 Attribute Image-text Retrieval
Code Code Available 0Variational Deep Semantic Hashing for Text Documents Aug 11, 2017 Image Retrieval Information Retrieval
Code Code Available 0It Takes Two to Tango: Combining Visual and Textual Information for Detecting Duplicate Video-Based Bug Reports Jan 22, 2021 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0Shallow Cross-Encoders for Low-Latency Retrieval Mar 29, 2024 CPU GPU
Code Code Available 0Tree-Based Text Retrieval via Hierarchical Clustering in RAGFrameworks: Application on Taiwanese Regulations Jun 16, 2025 RAG Retrieval
Code Code Available 0A Bi-metric Framework for Fast Similarity Search Jun 5, 2024 MTEB Benchmark Re-Ranking
Code Code Available 0Intra-Modal Constraint Loss For Image-Text Retrieval Jul 11, 2022 Cross-Modal Retrieval Image-text Retrieval
Code Code Available 0Denoising Table-Text Retrieval for Open-Domain Question Answering Mar 26, 2024 Denoising Open-Domain Question Answering
Code Code Available 0DeepTileBars: Visualizing Term Distribution for Neural Information Retrieval Nov 1, 2018 Ad-Hoc Information Retrieval Document Ranking
Code Code Available 0Single Shot Scene Text Retrieval Aug 27, 2018 Image Retrieval Retrieval
Code Code Available 0Single-Stream Multi-Level Alignment for Vision-Language Pretraining Mar 27, 2022 Image-text Retrieval Question Answering
Code Code Available 0Video-Text Retrieval by Supervised Sparse Multi-Grained Learning Feb 19, 2023 Representation Learning Retrieval
Code Code Available 0Socratic Models: Composing Zero-Shot Multimodal Reasoning with Language Apr 1, 2022 Diversity Image Captioning
Code Code Available 0