| WebWalker: Benchmarking LLMs in Web Traversal | Jan 13, 2025 | BenchmarkingOpen-Domain Question Answering | CodeCode Available | 11 |
| What Makes Good In-Context Examples for GPT-3? | Jan 17, 2021 | Few-Shot LearningNatural Language Understanding | CodeCode Available | 4 |
| ERNIE 2.0: A Continual Pre-training Framework for Language Understanding | Jul 29, 2019 | Chinese Named Entity RecognitionChinese Reading Comprehension | CodeCode Available | 3 |
| Generating Long Sequences with Sparse Transformers | Apr 23, 2019 | DiversityImage Generation | CodeCode Available | 3 |
| TableRAG: A Retrieval Augmented Generation Framework for Heterogeneous Document Reasoning | Jun 12, 2025 | Answer GenerationChunking | CodeCode Available | 2 |
| Talk Structurally, Act Hierarchically: A Collaborative Framework for LLM Multi-Agent Systems | Feb 16, 2025 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering | Oct 21, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation | Oct 11, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| Blended RAG: Improving RAG (Retriever-Augmented Generation) Accuracy with Semantic Search and Hybrid Query-Based Retrievers | Mar 22, 2024 | Information Retrieval | CodeCode Available | 2 |
| RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering | Feb 26, 2024 | FormOpen-Domain Question Answering | CodeCode Available | 2 |
| PEDANTS: Cheap but Effective and Interpretable Answer Equivalence | Feb 17, 2024 | BenchmarkingForm | CodeCode Available | 2 |
| ITINERA: Integrating Spatial Optimization with Large Language Models for Open-domain Urban Itinerary Planning | Feb 11, 2024 | LLM real-life tasksOpen-Domain Question Answering | CodeCode Available | 2 |
| Can AI Assistants Know What They Don't Know? | Jan 24, 2024 | MathOpen-Domain Question Answering | CodeCode Available | 2 |
| Learning to Filter Context for Retrieval-Augmented Generation | Nov 14, 2023 | Extractive Question-AnsweringFact Verification | CodeCode Available | 2 |
| Knowledge Graph Prompting for Multi-Document Question Answering | Aug 22, 2023 | graph constructionOpen-Domain Question Answering | CodeCode Available | 2 |
| RetroMAE-2: Duplex Masked Auto-Encoder For Pre-Training Retrieval-Oriented Language Models | May 4, 2023 | Information RetrievalOpen-Domain Question Answering | CodeCode Available | 2 |
| Generate rather than Retrieve: Large Language Models are Strong Context Generators | Sep 21, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering | Sep 20, 2022 | Multimodal Deep LearningMultimodal Reasoning | CodeCode Available | 2 |
| Atlas: Few-shot Learning with Retrieval Augmented Language Models | Aug 5, 2022 | Fact CheckingFew-Shot Learning | CodeCode Available | 2 |
| QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs | May 25, 2022 | Answer GenerationNatural Questions | CodeCode Available | 2 |
| RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder | May 24, 2022 | DecoderInformation Retrieval | CodeCode Available | 2 |
| ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction | Dec 2, 2021 | Information RetrievalOpen-Domain Question Answering | CodeCode Available | 2 |
| A Replication Study of Dense Passage Retriever | Apr 12, 2021 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| Learning Dense Representations of Phrases at Scale | Dec 23, 2020 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 2 |
| What Disease does this Patient Have? A Large-scale Open Domain Question Answering Dataset from Medical Exams | Sep 28, 2020 | MedQAMultiple-choice | CodeCode Available | 2 |
| Relevance-guided Supervision for OpenQA with ColBERT | Jul 1, 2020 | Natural QuestionsOpen-Domain Question Answering | CodeCode Available | 2 |
| ktrain: A Low-Code Library for Augmented Machine Learning | Apr 19, 2020 | BIG-bench Machine LearningClassification | CodeCode Available | 2 |
| Reformer: The Efficient Transformer | Jan 13, 2020 | D4RLImage Generation | CodeCode Available | 2 |
| ECoRAG: Evidentiality-guided Compression for Long Context RAG | Jun 5, 2025 | Answer GenerationOpen-Domain Question Answering | CodeCode Available | 1 |
| NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning | May 21, 2025 | General Reinforcement LearningLogical Reasoning | CodeCode Available | 1 |
| Context Awareness Gate For Retrieval Augmented Generation | Nov 25, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 1 |
| BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression | Oct 20, 2024 | In-Context LearningLong-Context Understanding | CodeCode Available | 1 |
| Exploring Hint Generation Approaches in Open-Domain Question Answering | Sep 24, 2024 | Hint GenerationOpen-Domain Question Answering | CodeCode Available | 1 |
| W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering | Aug 15, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 1 |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Aug 12, 2024 | Answer GenerationDecoder | CodeCode Available | 1 |
| TANQ: An open domain dataset of table answered questions | May 13, 2024 | MathOpen-Domain Question Answering | CodeCode Available | 1 |
| Enhancing Contextual Understanding in Large Language Models through Contrastive Decoding | May 4, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Spiral of Silence: How is Large Language Model Killing Information Retrieval? -- A Case Study on Open Domain Question Answering | Apr 16, 2024 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| Multi-Granularity Guided Fusion-in-Decoder | Apr 3, 2024 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| ArabicaQA: A Comprehensive Dataset for Arabic Question Answering | Mar 26, 2024 | BenchmarkingMachine Reading Comprehension | CodeCode Available | 1 |
| Beyond Memorization: The Challenge of Random Memory Access in Language Models | Mar 12, 2024 | MemorizationOpen-Domain Question Answering | CodeCode Available | 1 |
| Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering | Mar 8, 2024 | Answer GenerationOpen-Domain Question Answering | CodeCode Available | 1 |
| To Generate or to Retrieve? On the Effectiveness of Artificial Contexts for Medical Open-Domain Question Answering | Mar 4, 2024 | MedQAMMLU | CodeCode Available | 1 |
| REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering | Feb 27, 2024 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization | Dec 30, 2023 | Answer GenerationContrastive Learning | CodeCode Available | 1 |
| Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models | Oct 23, 2023 | Open-Domain Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Merging Generated and Retrieved Knowledge for Open-Domain QA | Oct 22, 2023 | DecoderOpen-Domain Question Answering | CodeCode Available | 1 |
| MoqaGPT : Zero-Shot Multi-modal Open-domain Question Answering with Large Language Model | Oct 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning | Oct 20, 2023 | In-Context LearningMulti-hop Question Answering | CodeCode Available | 1 |
| Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators | Oct 11, 2023 | Information RetrievalInformativeness | CodeCode Available | 1 |