| GNN-RAG: Graph Neural Retrieval for Large Language Model Reasoning | May 30, 2024 | Graph Question AnsweringKnowledge Graphs | CodeCode Available | 3 |
| LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding | Apr 25, 2024 | GSM8KHellaSwag | CodeCode Available | 3 |
| ST-MoE: Designing Stable and Transferable Sparse Expert Models | Feb 17, 2022 | ARCCommon Sense Reasoning | CodeCode Available | 3 |
| DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving | Jun 18, 2024 | Arithmetic ReasoningMath | CodeCode Available | 2 |
| JADE: A Linguistics-based Safety Evaluation Platform for Large Language Models | Nov 1, 2023 | Natural Questions | CodeCode Available | 2 |
| Atlas: Few-shot Learning with Retrieval Augmented Language Models | Aug 5, 2022 | Fact CheckingFew-Shot Learning | CodeCode Available | 2 |
| QAMPARI: An Open-domain Question Answering Benchmark for Questions with Many Answers from Multiple Paragraphs | May 25, 2022 | Answer GenerationNatural Questions | CodeCode Available | 2 |
| Scaling Language Models: Methods, Analysis & Insights from Training Gopher | Dec 8, 2021 | Abstract AlgebraAnachronisms | CodeCode Available | 2 |
| Relevance-guided Supervision for OpenQA with ColBERT | Jul 1, 2020 | Natural QuestionsOpen-Domain Question Answering | CodeCode Available | 2 |
| Constructing and Evaluating Declarative RAG Pipelines in PyTerrier | Jun 12, 2025 | Natural QuestionsRAG | CodeCode Available | 1 |
| Lightweight and Direct Document Relevance Optimization for Generative Information Retrieval | Apr 7, 2025 | Information RetrievalNatural Questions | CodeCode Available | 1 |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Aug 12, 2024 | Answer GenerationDecoder | CodeCode Available | 1 |
| LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone Sensors | Jun 20, 2024 | 16kInstruction Following | CodeCode Available | 1 |
| Multi-Granularity Guided Fusion-in-Decoder | Apr 3, 2024 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| CLAPNQ: Cohesive Long-form Answers from Passages in Natural Questions for RAG systems | Apr 2, 2024 | FormLong Form Question Answering | CodeCode Available | 1 |
| Poisoning Retrieval Corpora by Injecting Adversarial Passages | Oct 29, 2023 | Information RetrievalNatural Questions | CodeCode Available | 1 |
| Model-enhanced Vector Index | Sep 23, 2023 | modelNatural Questions | CodeCode Available | 1 |
| AdANNS: A Framework for Adaptive Semantic Search | May 30, 2023 | Natural QuestionsQuantization | CodeCode Available | 1 |
| Exploiting Abstract Meaning Representation for Open-Domain Question Answering | May 26, 2023 | Abstract Meaning RepresentationDiversity | CodeCode Available | 1 |
| Generative Retrieval via Term Set Generation | May 23, 2023 | Information RetrievalNatural Questions | CodeCode Available | 1 |
| Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness | Jan 21, 2023 | DiagnosticNatural Questions | CodeCode Available | 1 |
| On the Effectiveness of Parameter-Efficient Fine-Tuning | Nov 28, 2022 | Natural Questionsparameter-efficient fine-tuning | CodeCode Available | 1 |
| DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering | Nov 10, 2022 | counterfactualData Augmentation | CodeCode Available | 1 |
| Recitation-Augmented Language Models | Oct 4, 2022 | Natural QuestionsQuestion Answering | CodeCode Available | 1 |
| MultiSpanQA: A Dataset for Multi-Span Question Answering | Jul 1, 2022 | Natural QuestionsQuestion Answering | CodeCode Available | 1 |