| Towards Better Multi-modal Keyphrase Generation via Visual Entity Enhancement and Multi-granularity Image Noise Filtering | Sep 9, 2023 | Image CaptioningImage-text matching | CodeCode Available | 0 | 5 |
| FactGenius: Combining Zero-Shot Prompting and Fuzzy Relation Mining to Improve Fact Verification with Knowledge Graphs | Jun 3, 2024 | Fact CheckingFact Verification | CodeCode Available | 0 | 5 |
| A Study of MatchPyramid Models on Ad-hoc Retrieval | Jun 15, 2016 | Machine TranslationParaphrase Identification | CodeCode Available | 0 | 5 |
| TIGEr: Text-to-Image Grounding for Image Caption Evaluation | Sep 4, 2019 | Image CaptioningText Matching | CodeCode Available | 0 | 5 |
| Evaluating Attribute Comprehension in Large Vision-Language Models | Aug 25, 2024 | AttributeImage-text matching | CodeCode Available | 0 | 5 |
| Supervised Contrastive Learning for Interpretable Long-Form Document Matching | Aug 20, 2021 | ArticlesContrastive Learning | CodeCode Available | 0 | 5 |
| Enhancing Image-Text Matching with Adaptive Feature Aggregation | Jan 18, 2024 | Image-text matchingImage-text Retrieval | CodeCode Available | 0 | 5 |
| SimVTP: Simple Video Text Pre-training with Masked Autoencoders | Dec 7, 2022 | Contrastive Learningcross-modal alignment | CodeCode Available | 0 | 5 |
| Simple and Effective Text Matching with Richer Alignment Features | Aug 1, 2019 | Answer SelectionNatural Language Inference | CodeCode Available | 0 | 5 |
| Robust Interaction-Based Relevance Modeling for Online e-Commerce Search | Jun 4, 2024 | RetrievalSentence | CodeCode Available | 0 | 5 |
| Efficient and Long-Tailed Generalization for Pre-trained Vision-Language Model | Jun 18, 2024 | Image-text matchingLanguage Modeling | CodeCode Available | 0 | 5 |
| RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching Models | Apr 21, 2023 | Cross-Modal RetrievalImage-text matching | CodeCode Available | 0 | 5 |
| Co-Driven Recognition of Semantic Consistency via the Fusion of Transformer and HowNet Sememes Knowledge | Feb 21, 2023 | Paraphrase IdentificationSentence | CodeCode Available | 0 | 5 |
| Delving into the Openness of CLIP | Jun 4, 2022 | image-classificationImage Classification | CodeCode Available | 0 | 5 |
| Dual Attention Networks for Multimodal Reasoning and Matching | Nov 2, 2016 | Collaborative InferenceImage-text matching | CodeCode Available | 0 | 5 |
| Response Ranking with Deep Matching Networks and External Knowledge in Information-seeking Conversation Systems | May 1, 2018 | Knowledge DistillationRetrieval | CodeCode Available | 0 | 5 |
| CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs | Jan 5, 2024 | Image ComprehensionImage to text | CodeCode Available | 0 | 5 |
| Compositional Image-Text Matching and Retrieval by Grounding Entities | May 4, 2025 | Image CaptioningImage-text matching | CodeCode Available | 0 | 5 |
| Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies? | Oct 21, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 0 | 5 |
| Query-bag Matching with Mutual Coverage for Information-seeking Conversations in E-commerce | Nov 7, 2019 | Text Matching | CodeCode Available | 0 | 5 |
| Random Text Perturbations Work, but not Always | Sep 2, 2022 | ClassificationData Augmentation | CodeCode Available | 0 | 5 |
| Constructing Interpretive Spatio-Temporal Features for Multi-Turn Responses Selection | Jul 1, 2019 | Text Matching | CodeCode Available | 0 | 5 |
| Document Similarity for Texts of Varying Lengths via Hidden Topics | Mar 26, 2019 | Text Matching | CodeCode Available | 0 | 5 |
| Dissecting Deep Metric Learning Losses for Image-Text Retrieval | Oct 21, 2022 | Cross-Modal RetrievalImage-text matching | CodeCode Available | 0 | 5 |
| Position Focused Attention Network for Image-Text Matching | Jul 23, 2019 | Image-text matchingPosition | CodeCode Available | 0 | 5 |
| Prompt-based Effective Input Reformulation for Legal Case Retrieval | Sep 6, 2023 | RetrievalText Matching | CodeCode Available | 0 | 5 |
| Semantic Matching from Different Perspectives | Feb 14, 2022 | SentenceText Matching | CodeCode Available | 0 | 5 |
| DISCO: A Hierarchical Disentangled Cognitive Diagnosis Framework for Interpretable Job Recommendation | Oct 10, 2024 | cognitive diagnosisContrastive Learning | CodeCode Available | 0 | 5 |
| A Modern Perspective on Query Likelihood with Deep Generative Retrieval Models | Jun 25, 2021 | Passage Re-RankingPassage Retrieval | CodeCode Available | 0 | 5 |
| Modeling Selective Feature Attention for Representation-based Siamese Text Matching | Apr 25, 2024 | Text Matching | CodeCode Available | 0 | 5 |
| Deep Cross-Modal Projection Learning for Image-Text Matching | Sep 1, 2018 | Cross-Modal RetrievalImage-text matching | CodeCode Available | 0 | 5 |
| MatchZoo: A Toolkit for Deep Text Matching | Jul 23, 2017 | Ad-Hoc Information RetrievalInformation Retrieval | CodeCode Available | 0 | 5 |
| Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking | Aug 12, 2019 | Binary ClassificationGeneral Classification | CodeCode Available | 0 | 5 |
| ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval | Jul 29, 2022 | Cross-Modal RetrievalImage-text matching | CodeCode Available | 0 | 5 |
| Beyond Image-Text Matching: Verb Understanding in Multimodal Transformers Using Guided Masking | Jan 29, 2024 | Image-text matchingText Matching | CodeCode Available | 0 | 5 |
| Matching Article Pairs with Graphical Decomposition and Convolutions | Feb 21, 2018 | Articlesdocument understanding | CodeCode Available | 0 | 5 |
| MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets | Mar 5, 2024 | DiversityImage Description | CodeCode Available | 0 | 5 |
| Luandri: a Clean Lua Interface to the Indri Search Engine | Feb 16, 2017 | Information RetrievalRetrieval | CodeCode Available | 0 | 5 |
| MALM: Mask Augmentation based Local Matching for Food-Recipe Retrieval | May 18, 2023 | Image-text matchingRetrieval | CodeCode Available | 0 | 5 |
| LLMs and Memorization: On Quality and Specificity of Copyright Compliance | May 28, 2024 | HallucinationMemorization | CodeCode Available | 0 | 5 |
| Learning Two-Branch Neural Networks for Image-Text Matching Tasks | Apr 11, 2017 | Image-text matchingRetrieval | CodeCode Available | 0 | 5 |
| Matching with Text Data: An Experimental Evaluation of Methods for Matching Documents and of Measuring Match Quality | Jan 2, 2018 | ArticlesCausal Inference | CodeCode Available | 0 | 5 |
| Hierarchical Similarity Learning for Language-based Product Image Retrieval | Feb 18, 2021 | Image RetrievalRetrieval | CodeCode Available | 0 | 5 |
| Backdoor Attack on Unpaired Medical Image-Text Foundation Models: A Pilot Study on MedCLIP | Jan 1, 2024 | Backdoor AttackContrastive Learning | CodeCode Available | 0 | 5 |
| Law Article-Enhanced Legal Case Matching: a Causal Learning Approach | Oct 20, 2022 | ArticlesSemantic Text Matching | CodeCode Available | 0 | 5 |
| Learning Fine-grained Fact-Article Correspondence in Legal Cases | Apr 21, 2021 | ArticlesText Matching | CodeCode Available | 0 | 5 |
| COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-Training for Vision-Language Representation | Jan 1, 2021 | Contrastive LearningCross-Modal Retrieval | CodeCode Available | 0 | 5 |
| Beyond 512 Tokens: Siamese Multi-depth Transformer-based Hierarchical Encoder for Long-Form Document Matching | Apr 26, 2020 | ClusteringForm | CodeCode Available | 0 | 5 |
| GR-GAN: Gradual Refinement Text-to-image Generation | May 23, 2022 | Generative Adversarial NetworkImage Generation | CodeCode Available | 0 | 5 |
| Learning fragment self-attention embeddings for image-text matching | Oct 1, 2019 | Image-text matchingSentence | CodeCode Available | 0 | 5 |