| Constructing and Evaluating Declarative RAG Pipelines in PyTerrier | Jun 12, 2025 | Natural QuestionsRAG | CodeCode Available | 1 |
| Retrieval-Augmented Generation as Noisy In-Context Learning: A Unified Theory and Risk Bounds | Jun 3, 2025 | In-Context LearningNatural Questions | —Unverified | 0 |
| CRAFT: Training-Free Cascaded Retrieval for Tabular QA | May 21, 2025 | Natural Language QueriesNatural Questions | —Unverified | 0 |
| PoisonArena: Uncovering Competing Poisoning Attacks in Retrieval-Augmented Generation | May 18, 2025 | MisinformationNatural Questions | CodeCode Available | 0 |
| Pre-training vs. Fine-tuning: A Reproducibility Study on Dense Retrieval Knowledge Acquisition | May 12, 2025 | Contrastive LearningDecoder | CodeCode Available | 0 |
| GRADA: Graph-based Reranker against Adversarial Documents Attack | May 12, 2025 | Natural QuestionsRAG | CodeCode Available | 0 |
| Chain-of-Defensive-Thought: Structured Reasoning Elicits Robustness in Large Language Models against Reference Corruption | Apr 29, 2025 | Natural Questions | —Unverified | 0 |
| S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models | Apr 14, 2025 | Natural Questions | CodeCode Available | 0 |
| Lightweight and Direct Document Relevance Optimization for Generative Information Retrieval | Apr 7, 2025 | Information RetrievalNatural Questions | CodeCode Available | 1 |
| Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception | Feb 17, 2025 | MMLUNatural Questions | —Unverified | 0 |
| CacheFocus: Dynamic Cache Re-Positioning for Efficient Retrieval-Augmented Generation | Feb 16, 2025 | Natural QuestionsRetrieval | —Unverified | 0 |
| Optimizing Knowledge Integration in Retrieval-Augmented Generation with Self-Selection | Feb 10, 2025 | Natural QuestionsRAG | —Unverified | 0 |
| DragonVerseQA: Open-Domain Long-Form Context-Aware Question-Answering | Dec 21, 2024 | ArticlesForm | CodeCode Available | 0 |
| Advanced RAG Models with Graph Structures: Optimizing Complex Knowledge Reasoning and Text Generation | Nov 6, 2024 | Graph Neural NetworkKnowledge Graphs | —Unverified | 0 |
| ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope Questions | Oct 18, 2024 | HallucinationNatural Questions | CodeCode Available | 0 |
| Local Explanations and Self-Explanations for Assessing Faithfulness in black-box LLMs | Sep 18, 2024 | Natural Questions | —Unverified | 0 |
| Investigating Context-Faithfulness in Large Language Models: The Roles of Memory Strength and Evidence Style | Sep 17, 2024 | Natural QuestionsRAG | —Unverified | 0 |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Aug 12, 2024 | Answer GenerationDecoder | CodeCode Available | 1 |
| AutoRAG-HP: Automatic Online Hyper-Parameter Tuning for Retrieval-Augmented Generation | Jun 27, 2024 | AutoMLEfficient Exploration | —Unverified | 0 |
| LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone Sensors | Jun 20, 2024 | 16kInstruction Following | CodeCode Available | 1 |
| DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving | Jun 18, 2024 | Arithmetic ReasoningMath | CodeCode Available | 2 |
| Unifying Multimodal Retrieval via Document Screenshot Embedding | Jun 17, 2024 | Language ModellingNatural Questions | —Unverified | 0 |
| Scaling the Vocabulary of Non-autoregressive Models for Efficient Generative Retrieval | Jun 10, 2024 | Inference OptimizationInformation Retrieval | —Unverified | 0 |
| RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation | Jun 9, 2024 | Document RankingNatural Questions | CodeCode Available | 0 |
| GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning | May 30, 2024 | Graph Question AnsweringKnowledge Graphs | CodeCode Available | 3 |
| DGRC: An Effective Fine-tuning Framework for Distractor Generation in Chinese Multi-choice Reading Comprehension | May 29, 2024 | Distractor GenerationMultiple-choice | —Unverified | 0 |
| LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding | Apr 25, 2024 | GSM8KHellaSwag | CodeCode Available | 3 |
| KazQAD: Kazakh Open-Domain Question Answering Dataset | Apr 6, 2024 | Information RetrievalMachine Translation | CodeCode Available | 0 |
| Multi-Granularity Guided Fusion-in-Decoder | Apr 3, 2024 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems | Apr 2, 2024 | FormLong Form Question Answering | CodeCode Available | 1 |
| Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision | Feb 26, 2024 | Answer GenerationCross-Lingual Question Answering | CodeCode Available | 0 |
| Summarization-Based Document IDs for Generative Retrieval with Language Models | Nov 14, 2023 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| JADE: A Linguistics-based Safety Evaluation Platform for Large Language Models | Nov 1, 2023 | Natural Questions | CodeCode Available | 2 |
| Poisoning Retrieval Corpora by Injecting Adversarial Passages | Oct 29, 2023 | Information RetrievalNatural Questions | CodeCode Available | 1 |
| ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks | Oct 19, 2023 | HallucinationHallucination Evaluation | —Unverified | 0 |
| Retrieval meets Long Context Large Language Models | Oct 4, 2023 | 16k4k | —Unverified | 0 |
| Model-enhanced Vector Index | Sep 23, 2023 | modelNatural Questions | CodeCode Available | 1 |
| Diversifying Question Generation over Knowledge Base via External Natural Questions | Sep 23, 2023 | DiversityNatural Questions | —Unverified | 0 |
| On Monotonic Aggregation for Open-domain QA | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TRAQ: Trustworthy Retrieval Augmented Question Answering via Conformal Prediction | Jul 7, 2023 | Bayesian OptimizationChatbot | CodeCode Available | 0 |
| When to Read Documents or QA History: On Unified and Selective Open-domain QA | Jun 7, 2023 | Natural QuestionsOpen-Domain Question Answering | —Unverified | 0 |
| AdANNS: A Framework for Adaptive Semantic Search | May 30, 2023 | Natural QuestionsQuantization | CodeCode Available | 1 |
| Information Association for Language Model Updating by Mitigating LM-Logical Discrepancy | May 29, 2023 | Answer GenerationArticles | —Unverified | 0 |
| Exploiting Abstract Meaning Representation for Open-Domain Question Answering | May 26, 2023 | Abstract Meaning RepresentationDiversity | CodeCode Available | 1 |
| RFiD: Towards Rational Fusion-in-Decoder for Open-Domain Question Answering | May 26, 2023 | DecoderNatural Questions | CodeCode Available | 0 |
| Generative Retrieval via Term Set Generation | May 23, 2023 | Information RetrievalNatural Questions | CodeCode Available | 1 |
| CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing | May 19, 2023 | Fact CheckingNatural Questions | CodeCode Available | 0 |
| TOME: A Two-stage Approach for Model-based Retrieval | May 18, 2023 | Natural QuestionsRetrieval | —Unverified | 0 |
| MeeQA: Natural Questions in Meeting Transcripts | May 15, 2023 | Natural QuestionsQuestion Answering | CodeCode Available | 0 |
| Noise-Robust Dense Retrieval via Contrastive Alignment Post Training | Apr 6, 2023 | Data AugmentationDocument Ranking | —Unverified | 0 |