| Cross-Thought for Sentence Encoder Pre-training | Oct 7, 2020 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| CSFCube -- A Test Collection of Computer Science Research Articles for Faceted Query by Example | Mar 24, 2021 | ArticlesInformation Retrieval | CodeCode Available | 1 | 5 |
| C-STS: Conditional Semantic Textual Similarity | May 24, 2023 | Information RetrievalLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| NaturalProofs: Mathematical Theorem Proving in Natural Language | Mar 24, 2021 | Automated Theorem ProvingDomain Generalization | CodeCode Available | 1 | 5 |
| Hengam: An Adversarially Trained Transformer for Persian Temporal Tagging | Nov 20, 2022 | Information RetrievalNamed Entity Recognition (NER) | CodeCode Available | 1 | 5 |
| A Comparison of Supervised Learning to Match Methods for Product Search | Jul 20, 2020 | ARCAttribute | CodeCode Available | 1 | 5 |
| Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators | Oct 11, 2023 | Information RetrievalInformativeness | CodeCode Available | 1 | 5 |
| Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding | Feb 28, 2024 | document understandingInformation Retrieval | CodeCode Available | 1 | 5 |
| Hyperbolic Relevance Matching for Neural Keyphrase Extraction | May 4, 2022 | Information RetrievalKeyphrase Extraction | CodeCode Available | 1 | 5 |
| Keyphrase Extraction from Scientific Articles via Extractive Summarization | Jun 1, 2021 | ArticlesExtractive Summarization | CodeCode Available | 1 | 5 |
| Back to the Basics: A Quantitative Analysis of Statistical and Graph-Based Term Weighting Schemes for Keyword Extraction | Apr 16, 2021 | Information RetrievalKeyword Extraction | CodeCode Available | 1 | 5 |
| G-RAG: Knowledge Expansion in Material Science | Nov 21, 2024 | Information RetrievalRAG | CodeCode Available | 1 | 5 |
| Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial Pre-training | Jul 11, 2024 | Information RetrievalMusic Information Retrieval | CodeCode Available | 1 | 5 |
| GPU-based Private Information Retrieval for On-Device Machine Learning Inference | Jan 26, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| Grep-BiasIR: A Dataset for Investigating Gender Representation-Bias in Information Retrieval Results | Jan 19, 2022 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| GFTE: Graph-based Financial Table Extraction | Mar 17, 2020 | Information RetrievalPosition | CodeCode Available | 1 | 5 |
| Automatic Jailbreaking of the Text-to-Image Generative AI Systems | May 26, 2024 | Image GenerationInformation Retrieval | CodeCode Available | 1 | 5 |
| One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks | Sep 20, 2024 | AllDependency Parsing | CodeCode Available | 1 | 5 |
| GitTables: A Large-Scale Corpus of Relational Tables | Jun 14, 2021 | Information RetrievalTable annotation | CodeCode Available | 1 | 5 |
| Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural Networks | Jan 30, 2023 | Ad-Hoc Information RetrievalArticles | CodeCode Available | 1 | 5 |
| Automatic Generation of Topic Labels | May 29, 2020 | DescriptiveInformation Retrieval | CodeCode Available | 1 | 5 |
| Few-Shot Generative Conversational Query Rewriting | Jun 9, 2020 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders | Aug 12, 2020 | Cross-Modal Information RetrievalCross-Modal Retrieval | CodeCode Available | 1 | 5 |
| FairDiverse: A Comprehensive Toolkit for Fair and Diverse Information Retrieval Algorithms | Feb 17, 2025 | DiversityFairness | CodeCode Available | 1 | 5 |
| Extending Context Window of Large Language Models via Semantic Compression | Dec 15, 2023 | Few-Shot LearningInformation Retrieval | CodeCode Available | 1 | 5 |
| Fast k-NN Graph Construction by GPU based NN-Descent | Oct 30, 2021 | CPUGPU | CodeCode Available | 1 | 5 |
| Exploring Dual Encoder Architectures for Question Answering | Apr 14, 2022 | Information RetrievalQuestion Answering | CodeCode Available | 1 | 5 |
| Exploring Classic and Neural Lexical Translation Models for Information Retrieval: Interpretability, Effectiveness, and Efficiency Benefits | Feb 12, 2021 | CPUDocument Ranking | CodeCode Available | 1 | 5 |
| Exploring _0 Sparsification for Inference-free Sparse Retrievers | Apr 21, 2025 | Computational EfficiencyInformation Retrieval | CodeCode Available | 1 | 5 |
| Fast Passage Re-ranking with Contextualized Exact Term Matching and Efficient Passage Expansion | Aug 19, 2021 | CPUInformation Retrieval | CodeCode Available | 1 | 5 |
| GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration | Jun 2, 2023 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| GuP: Fast Subgraph Matching by Guard-based Pruning | Jun 11, 2023 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| An Adversarial Imitation Click Model for Information Retrieval | Apr 13, 2021 | Imitation LearningInformation Retrieval | CodeCode Available | 1 | 5 |
| A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers | May 17, 2024 | Information RetrievalSurvey | CodeCode Available | 1 | 5 |
| Enhancing Complex Question Answering over Knowledge Graphs through Evidence Pattern Retrieval | Feb 3, 2024 | Information RetrievalKnowledge Graphs | CodeCode Available | 1 | 5 |
| Enhancing Cross-Sectional Currency Strategies by Context-Aware Learning to Rank with Self-Attention | May 20, 2021 | Information RetrievalLearning-To-Rank | CodeCode Available | 1 | 5 |
| ESPN: Memory-Efficient Multi-Vector Information Retrieval | Dec 9, 2023 | Information RetrievalRe-Ranking | CodeCode Available | 1 | 5 |
| ExaRanker-Open: Synthetic Explanation for IR using Open-Source LLMs | Feb 9, 2024 | Data AugmentationInformation Retrieval | CodeCode Available | 1 | 5 |
| AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative Reasoning | Oct 16, 2024 | Decision MakingInformation Retrieval | CodeCode Available | 1 | 5 |
| ATR4S: Toolkit with State-of-the-art Automatic Terms Recognition Methods in Scala | Nov 23, 2016 | Information RetrievalMachine Translation | CodeCode Available | 1 | 5 |
| audioLIME: Listenable Explanations Using Source Separation | Aug 2, 2020 | Information RetrievalMusic Information Retrieval | CodeCode Available | 1 | 5 |
| Attention Lens: A Tool for Mechanistically Interpreting the Attention Head Information Retrieval Mechanism | Oct 25, 2023 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| Audio Embeddings as Teachers for Music Classification | Jun 30, 2023 | ClassificationInformation Retrieval | CodeCode Available | 1 | 5 |
| Efficiently predicting high resolution mass spectra with graph neural networks | Jan 26, 2023 | Graph ClassificationInformation Retrieval | CodeCode Available | 1 | 5 |
| A Statutory Article Retrieval Dataset in French | Aug 26, 2021 | ArticlesInformation Retrieval | CodeCode Available | 1 | 5 |
| Augmenting Document Representations for Dense Retrieval with Interpolation and Perturbation | Mar 15, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 1 | 5 |
| Embed2Detect: Temporally Clustered Embedded Words for Event Detection in Social Media | Jun 10, 2020 | ClusteringEvent Detection | CodeCode Available | 1 | 5 |
| AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant | Nov 11, 2024 | Decision MakingHallucination | CodeCode Available | 1 | 5 |
| A multi-task semi-supervised framework for Text2Graph & Graph2Text | Feb 12, 2022 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| Efficient fine-tuning methodology of text embedding models for information retrieval: contrastive learning penalty (clp) | Dec 23, 2024 | Contrastive LearningInformation Retrieval | CodeCode Available | 1 | 5 |