| COIL: Revisit Exact Lexical Match in Information Retrieval with Contextualized Inverted List | Apr 15, 2021 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| Grep-BiasIR: A Dataset for Investigating Gender Representation-Bias in Information Retrieval Results | Jan 19, 2022 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution | Jul 31, 2023 | Information RetrievalInformativeness | CodeCode Available | 1 | 5 |
| History-Aware Conversational Dense Retrieval | Jan 30, 2024 | Conversational SearchInformation Retrieval | CodeCode Available | 1 | 5 |
| ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot Multilingual Information Retrieval | Feb 23, 2024 | Cross-Lingual TransferInformation Retrieval | CodeCode Available | 1 | 5 |
| Mixing-Specific Data Augmentation Techniques for Improved Blind Violin/Piano Source Separation | Aug 6, 2020 | Data AugmentationInformation Retrieval | CodeCode Available | 1 | 5 |
| mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset | Aug 31, 2021 | Information RetrievalMachine Translation | CodeCode Available | 1 | 5 |
| Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval | May 26, 2025 | Contrastive Learningcross-modal alignment | CodeCode Available | 1 | 5 |
| ComplexTempQA: A Large-Scale Dataset for Complex Temporal Question Answering | Jun 7, 2024 | Information RetrievalQuestion Answering | CodeCode Available | 1 | 5 |
| Complex Knowledge Base Question Answering: A Survey | Aug 15, 2021 | Information RetrievalKnowledge Base Question Answering | CodeCode Available | 1 | 5 |
| GFTE: Graph-based Financial Table Extraction | Mar 17, 2020 | Information RetrievalPosition | CodeCode Available | 1 | 5 |
| ATR4S: Toolkit with State-of-the-art Automatic Terms Recognition Methods in Scala | Nov 23, 2016 | Information RetrievalMachine Translation | CodeCode Available | 1 | 5 |
| A Deep Recurrent Survival Model for Unbiased Ranking | Apr 30, 2020 | Information Retrievalmodel | CodeCode Available | 1 | 5 |
| GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration | Jun 2, 2023 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| GitTables: A Large-Scale Corpus of Relational Tables | Jun 14, 2021 | Information RetrievalTable annotation | CodeCode Available | 1 | 5 |
| Multilingual Music Genre Embeddings for Effective Cross-Lingual Music Item Annotation | Sep 16, 2020 | Information RetrievalMusic Recommendation | CodeCode Available | 1 | 5 |
| A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers | May 17, 2024 | Information RetrievalSurvey | CodeCode Available | 1 | 5 |
| CREPE: A Convolutional Representation for Pitch Estimation | Feb 17, 2018 | Information RetrievalMusic Information Retrieval | CodeCode Available | 1 | 5 |
| Contrastive Audio-Language Learning for Music | Aug 25, 2022 | Audio to Text RetrievalDescriptive | CodeCode Available | 1 | 5 |
| AT-RAG: An Adaptive RAG Model Enhancing Query Efficiency with Topic Filtering and Iterative Reasoning | Oct 16, 2024 | Decision MakingInformation Retrieval | CodeCode Available | 1 | 5 |
| Conversational Document Prediction to Assist Customer Care Agents | Oct 5, 2020 | Information RetrievalPrediction | CodeCode Available | 1 | 5 |
| Conversational Question Answering over Passages by Leveraging Word Proximity Networks | Apr 27, 2020 | Conversational Question AnsweringInformation Retrieval | CodeCode Available | 1 | 5 |
| GPU-based Private Information Retrieval for On-Device Machine Learning Inference | Jan 26, 2023 | CPUGPU | CodeCode Available | 1 | 5 |
| Conversational Entity Linking: Problem Definition and Datasets | May 11, 2021 | Entity LinkingInformation Retrieval | CodeCode Available | 1 | 5 |
| HPI-DHC at TREC 2018 Precision Medicine Track | Nov 14, 2018 | ArticlesDocument Classification | CodeCode Available | 1 | 5 |
| CoRT: Complementary Rankings from Transformers | Oct 20, 2020 | Information RetrievalPassage Retrieval | CodeCode Available | 1 | 5 |
| Corpus-Steered Query Expansion with Large Language Models | Feb 28, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| Neural Code Search Revisited: Enhancing Code Snippet Retrieval through Natural Language Intent | Aug 27, 2020 | Annotated Code SearchCode Search | CodeCode Available | 1 | 5 |
| AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant | Nov 11, 2024 | Decision MakingHallucination | CodeCode Available | 1 | 5 |
| ASPIRE: Assistive System for Performance Evaluation in IR | Dec 20, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| Finding the Law: Enhancing Statutory Article Retrieval via Graph Neural Networks | Jan 30, 2023 | Ad-Hoc Information RetrievalArticles | CodeCode Available | 1 | 5 |
| OntoChatGPT Information System: Ontology-Driven Structured Prompts for ChatGPT Meta-Learning | Jul 11, 2023 | ChatbotInformation Retrieval | CodeCode Available | 1 | 5 |
| A Statutory Article Retrieval Dataset in French | Aug 26, 2021 | ArticlesInformation Retrieval | CodeCode Available | 1 | 5 |
| Discovering Mathematical Objects of Interest -- A Study of Mathematical Notations | Feb 7, 2020 | Information RetrievalMath | CodeCode Available | 1 | 5 |
| A Deep Generative Framework for Paraphrase Generation | Sep 15, 2017 | DecoderInformation Retrieval | CodeCode Available | 1 | 5 |
| Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders | Aug 12, 2020 | Cross-Modal Information RetrievalCross-Modal Retrieval | CodeCode Available | 1 | 5 |
| PairDistill: Pairwise Relevance Distillation for Dense Retrieval | Oct 2, 2024 | Information RetrievalKnowledge Distillation | CodeCode Available | 1 | 5 |
| Cross-Thought for Sentence Encoder Pre-training | Oct 7, 2020 | Information RetrievalLanguage Modeling | CodeCode Available | 1 | 5 |
| C-STS: Conditional Semantic Textual Similarity | May 24, 2023 | Information RetrievalLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| CSFCube -- A Test Collection of Computer Science Research Articles for Faceted Query by Example | Mar 24, 2021 | ArticlesInformation Retrieval | CodeCode Available | 1 | 5 |
| DebateSum: A large-scale argument mining and summarization dataset | Nov 14, 2020 | Abstractive Text SummarizationArgument Mining | CodeCode Available | 1 | 5 |
| Persian Keyphrase Generation Using Sequence-to-Sequence Models | Sep 25, 2020 | ArticlesInformation Retrieval | CodeCode Available | 1 | 5 |
| Are We There Yet? Evaluating State-of-the-Art Neural Network based Geoparsers Using EUPEG as a Benchmarking Platform | Jul 15, 2020 | ArticlesBenchmarking | CodeCode Available | 1 | 5 |
| DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions | May 26, 2023 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| Fast Passage Re-ranking with Contextualized Exact Term Matching and Efficient Passage Expansion | Aug 19, 2021 | CPUInformation Retrieval | CodeCode Available | 1 | 5 |
| Few-Shot Generative Conversational Query Rewriting | Jun 9, 2020 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous Decoding | Apr 22, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 | 5 |
| PlenoptiCam v1.0: A light-field imaging framework | Oct 14, 2020 | Camera CalibrationInformation Retrieval | CodeCode Available | 1 | 5 |
| Declarative Experimentation in Information Retrieval using PyTerrier | Jul 28, 2020 | GPUInformation Retrieval | CodeCode Available | 1 | 5 |
| Are "Undocumented Workers" the Same as "Illegal Aliens"? Disentangling Denotation and Connotation in Vector Spaces | Oct 6, 2020 | DiversityInformation Retrieval | CodeCode Available | 1 | 5 |