BM25S: Orders of magnitude faster lexical search via eager sparse scoring Jul 4, 2024 Passage Retrieval Retrieval
Code Code Available 55 SGPT: GPT Sentence Embeddings for Semantic Search Feb 17, 2022 Argument Retrieval Biomedical Information Retrieval
Code Code Available 25 Multi-CPR: A Multi Domain Chinese Dataset for Passage Retrieval Mar 7, 2022 Information Retrieval Passage Retrieval
Code Code Available 25 BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval Models Apr 17, 2021 Argument Retrieval Benchmarking
Code Code Available 25 ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction Dec 2, 2021 Information Retrieval Open-Domain Question Answering
Code Code Available 25 Retrieval Augmented Visual Question Answering with Outside Knowledge Oct 7, 2022 Answer Generation Diagnostic
Code Code Available 25 Retrieval Oriented Masking Pre-training Language Model for Dense Passage Retrieval Oct 27, 2022 Language Modeling Language Modelling
Code Code Available 25 CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation Oct 30, 2024 Benchmarking Passage Retrieval
Code Code Available 25 QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs May 25, 2022 Answer Generation Natural Questions
Code Code Available 25 Never Lost in the Middle: Mastering Long-Context Question Answering with Position-Agnostic Decompositional Training Nov 15, 2023 Passage Retrieval Position
Code Code Available 25 Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question Answering Sep 29, 2023 Image to text Passage Retrieval
Code Code Available 25 MiniLLM: Knowledge Distillation of Large Language Models Jun 14, 2023 Instruction Following Knowledge Distillation
Code Code Available 25 Embedding-based Zero-shot Retrieval through Query Generation Sep 22, 2020 Passage Retrieval Retrieval
Code Code Available 15 End-to-End Query Term Weighting Aug 6, 2023 Passage Retrieval Retrieval
Code Code Available 15 One Question Answering Model for Many Languages with Cross-lingual Dense Passage Retrieval Jul 26, 2021 Answer Generation Passage Retrieval
Code Code Available 15 MFAQ: a Multilingual FAQ Dataset Sep 27, 2021 Passage Retrieval Retrieval
Code Code Available 15 LePaRD: A Large-Scale Dataset of Judges Citing Precedents Nov 15, 2023 Passage Retrieval Prediction
Code Code Available 15 Leveraging Passage Retrieval with Generative Models for Open Domain Question Answering Jul 2, 2020 Natural Questions Open-Domain Question Answering
Code Code Available 15 On Single and Multiple Representations in Dense Passage Retrieval Aug 13, 2021 Passage Retrieval Re-Ranking
Code Code Available 15 Improving Passage Retrieval with Zero-Shot Question Generation Apr 15, 2022 Language Modeling Language Modelling
Code Code Available 15 Densifying Sparse Representations for Passage Retrieval by Representational Slicing Dec 9, 2021 Passage Retrieval Retrieval
Code Code Available 15 Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering Mar 14, 2022 Open-Domain Question Answering Passage Retrieval
Code Code Available 15 Efficient Passage Retrieval with Hashing for Open-domain Question Answering Jun 2, 2021 Natural Questions Open-Domain Question Answering
Code Code Available 15 Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval Jan 20, 2024 Decoder Passage Retrieval
Code Code Available 15 LexMAE: Lexicon-Bottlenecked Pretraining for Large-Scale Retrieval Aug 31, 2022 CPU Decoder
Code Code Available 15 Back-Training excels Self-Training at Unsupervised Domain Adaptation of Question Generation and Passage Retrieval Apr 18, 2021 BIG-bench Machine Learning Domain Adaptation
Code Code Available 15 Asyncval: A Toolkit for Asynchronously Validating Dense Retriever Checkpoints during Training Feb 25, 2022 GPU Natural Questions
Code Code Available 15 Fine-Tuning LLaMA for Multi-Stage Text Retrieval Oct 12, 2023 Passage Retrieval Retrieval
Code Code Available 15 Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval Jul 1, 2020 Contrastive Learning Passage Retrieval
Code Code Available 15 Generation-Augmented Retrieval for Open-domain Question Answering Sep 17, 2020 Natural Questions Open-Domain Question Answering
Code Code Available 15 Large Dual Encoders Are Generalizable Retrievers Dec 15, 2021 Domain Generalization Passage Retrieval
Code Code Available 15 Dealing with Typos for BERT-based Passage Retrieval and Ranking Aug 27, 2021 Information Retrieval Language Modeling
Code Code Available 15 Cross-document Event Coreference Search: Task, Dataset and Modeling Oct 23, 2022 Coreference Resolution Cross Document Coreference Resolution
Code Code Available 15 CoRT: Complementary Rankings from Transformers Oct 20, 2020 Information Retrieval Passage Retrieval
Code Code Available 15 CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market Sep 8, 2023 Articles Passage Retrieval
Code Code Available 15 FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection Aug 12, 2024 Answer Generation Decoder
Code Code Available 15 Curriculum Learning for Dense Retrieval Distillation Apr 28, 2022 Knowledge Distillation Passage Retrieval
Code Code Available 15 DAPR: A Benchmark on Document-Aware Passage Retrieval May 23, 2023 Articles Passage Retrieval
Code Code Available 15 Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval Jul 31, 2022 Knowledge Distillation Language Modeling
Code Code Available 15 CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos Apr 1, 2022 Passage Retrieval Retrieval
Code Code Available 15 Clickbait Spoiling via Question Answering and Passage Retrieval Mar 19, 2022 Passage Retrieval Question Answering
Code Code Available 15 Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation Jun 21, 2022 Information Retrieval Passage Retrieval
Code Code Available 15 Do the Findings of Document and Passage Retrieval Generalize to the Retrieval of Responses for Dialogues? Jan 13, 2023 Conversational Search Passage Retrieval
Code Code Available 15 GuRE:Generative Query REwriter for Legal Passage Retrieval May 19, 2025 Passage Retrieval Retrieval
Code Code Available 15 Augmenting Document Representations for Dense Retrieval with Interpolation and Perturbation Mar 15, 2022 Data Augmentation Information Retrieval
Code Code Available 15 I^3 Retriever: Incorporating Implicit Interaction in Pre-trained Language Models for Passage Retrieval Jun 4, 2023 Knowledge Distillation Passage Retrieval
Code Code Available 15 Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence May 27, 2021 Articles Document Ranking
Code Code Available 15 Exemplars-guided Empathetic Response Generation Controlled by the Elements of Human Communication Jun 22, 2021 Empathetic Response Generation Passage Retrieval
Code Code Available 15 ConTextual Masked Auto-Encoder for Dense Passage Retrieval Aug 16, 2022 Decoder Passage Retrieval
Code Code Available 15 ArabicaQA: A Comprehensive Dataset for Arabic Question Answering Mar 26, 2024 Benchmarking Machine Reading Comprehension
Code Code Available 15