| RETSim: Resilient and Efficient Text Similarity | Nov 28, 2023 | Adversarial TextClustering | CodeCode Available | 4 |
| MeaCap: Memory-Augmented Zero-shot Image Captioning | Mar 6, 2024 | Caption GenerationImage Captioning | CodeCode Available | 2 |
| FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization | Jan 17, 2025 | Anomaly DetectionImage-text matching | CodeCode Available | 2 |
| Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval | Mar 22, 2023 | Image-text matchingLanguage Modeling | CodeCode Available | 2 |
| Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP | Jun 25, 2024 | cross-modal alignmentImage Classification | CodeCode Available | 2 |
| CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation | Mar 21, 2023 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification | Feb 27, 2024 | ClassificationDiagnostic | CodeCode Available | 2 |
| A Unified Evaluation of Textual Backdoor Learning: Frameworks and Benchmarks | Jun 17, 2022 | text similarity | CodeCode Available | 2 |
| Fine-grained Image Captioning with CLIP Reward | May 26, 2022 | Caption GenerationDescriptive | CodeCode Available | 2 |
| SemEval-2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding | Apr 21, 2022 | Binary ClassificationSentence | CodeCode Available | 1 |
| Text and Code Embeddings by Contrastive Pre-Training | Jan 24, 2022 | Code SearchLinear-Probe Classification | CodeCode Available | 1 |
| Sentence Bottleneck Autoencoders from Transformer Language Models | Aug 31, 2021 | DecoderDenoising | CodeCode Available | 1 |
| Video Text Tracking With a Spatio-Temporal Complementary Model | Nov 9, 2021 | text similarity | CodeCode Available | 1 |
| Smoothed Contrastive Learning for Unsupervised Sentence Embedding | Sep 9, 2021 | Contrastive LearningSentence | CodeCode Available | 1 |
| Stacked Cross Attention for Image-Text Matching | Mar 21, 2018 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 1 |
| TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification | Mar 9, 2025 | Robot NavigationSTS | CodeCode Available | 1 |
| Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models? | Sep 4, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 |
| InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings | Oct 8, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| HHH: An Online Medical Chatbot System based on Knowledge Graph and Hierarchical Bi-Directional Attention | Feb 8, 2020 | ChatbotMedical question pair similarity computation | CodeCode Available | 1 |
| Holistic Features are almost Sufficient for Text-to-Video Retrieval | Jan 1, 2024 | Retrievaltext similarity | CodeCode Available | 1 |
| Learning Vector-Quantized Item Representation for Transferable Sequential Recommenders | Oct 22, 2022 | Language ModellingRecommendation Systems | CodeCode Available | 1 |
| Parrot Captions Teach CLIP to Spot Text | Dec 21, 2023 | Representation Learningtext similarity | CodeCode Available | 1 |
| Learning Semantic Relationship Among Instances for Image-Text Matching | Jan 1, 2023 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 1 |
| SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval | Dec 16, 2024 | FormInformation Retrieval | CodeCode Available | 1 |
| Starbucks: Improved Training for 2D Matryoshka Embeddings | Oct 17, 2024 | Language Modellingtext similarity | CodeCode Available | 1 |
| SynWMD: Syntax-aware Word Mover's Distance for Sentence Similarity Evaluation | Jun 20, 2022 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 1 |
| TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection | Apr 19, 2023 | ArticlesFake News Detection | CodeCode Available | 1 |
| Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation | Jul 2, 2024 | AnatomyClinical Knowledge | CodeCode Available | 1 |
| Context-Aware Legal Citation Recommendation using Deep Learning | Jun 20, 2021 | Citation RecommendationCollaborative Filtering | CodeCode Available | 1 |
| Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-hoc Retrieval | Dec 17, 2023 | ArticlesInformation Retrieval | CodeCode Available | 1 |
| Extensive Self-Contrast Enables Feedback-Free Language Model Alignment | Mar 31, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploring the Impact of Negative Samples of Contrastive Learning: A Case Study of Sentence Embedding | Feb 26, 2022 | Contrastive LearningSentence | CodeCode Available | 1 |
| Exploring Visual Interpretability for Contrastive Language-Image Pre-training | Sep 15, 2022 | Retrievaltext similarity | CodeCode Available | 1 |
| Fine-Grained Image-Text Matching by Cross-Modal Hard Aligning Network | Jan 1, 2023 | Image-text matchingRetrieval | CodeCode Available | 1 |
| Dataflow-Guided Retrieval Augmentation for Repository-Level Code Completion | May 30, 2024 | Code CompletionRetrieval | CodeCode Available | 1 |
| Automating the search for a patent's prior art with a full text similarity search | Jan 10, 2019 | text similarity | CodeCode Available | 1 |
| Equivariant Similarity for Vision-Language Foundation Models | Mar 25, 2023 | Image-text RetrievalRetrieval | CodeCode Available | 1 |
| GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing | May 16, 2025 | Instruction FollowingMultiple-choice | CodeCode Available | 1 |
| CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification | Jul 31, 2023 | Classificationimage-classification | CodeCode Available | 1 |
| ESimCSE: Enhanced Sample Building Method for Contrastive Learning of Unsupervised Sentence Embedding | Sep 9, 2021 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| An Empirical Study of Multi-Task Learning on BERT for Biomedical Text Mining | May 6, 2020 | Multi-Task Learningnamed-entity-recognition | CodeCode Available | 1 |
| Hyper-CL: Conditioning Sentence Representations with Hypernetworks | Mar 14, 2024 | Computational EfficiencyContrastive Learning | CodeCode Available | 1 |
| HANet: Hierarchical Alignment Networks for Video-Text Retrieval | Jul 26, 2021 | RetrievalText Matching | CodeCode Available | 1 |
| RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs | May 15, 2023 | reinforcement-learningRetrieval | CodeCode Available | 1 |
| Negative-Aware Attention Framework for Image-Text Matching | Jan 1, 2022 | Image-text matchingText Matching | CodeCode Available | 1 |
| NMTScore: A Multilingual Analysis of Translation-based Text Similarity Measures | Apr 28, 2022 | Data-to-Text GenerationMachine Translation | CodeCode Available | 1 |
| Assisting the Human Fact-Checkers: Detecting All Previously Fact-Checked Claims in a Document | Jan 16, 2022 | AllFact Checking | —Unverified | 0 |
| A Knowledge Graph Based Solution for Entity Discovery and Linking in Open-Domain Questions | Dec 5, 2018 | Entity LinkingLearning-To-Rank | —Unverified | 0 |
| Assisting humans in complex comparisons: automated information comparison at scale | Apr 5, 2024 | Abstractive Text SummarizationRetrieval | —Unverified | 0 |
| Assessing Privacy Risks in Language Models: A Case Study on Summarization Tasks | Oct 20, 2023 | text similarity | —Unverified | 0 |