| RECONSIDER: Improved Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering | Jun 1, 2021 | Machine Reading ComprehensionNatural Questions | —Unverified | 0 |
| Relation-Guided Pre-Training for Open-Domain Question Answering | Sep 21, 2021 | Natural QuestionsOpen-Domain Question Answering | —Unverified | 0 |
| Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents | Feb 27, 2024 | Known UnknownsQuestion Answering | —Unverified | 0 |
| Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation | Oct 27, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Self-Training Large Language Models for Tool-Use Without Demonstrations | Feb 9, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 |
| SFR-RAG: Towards Contextually Faithful LLMs | Sep 16, 2024 | counterfactualHallucination | —Unverified | 0 |
| ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices | Mar 23, 2025 | HallucinationTriviaQA | —Unverified | 0 |
| Simple and Effective Semi-Supervised Question Answering | Apr 2, 2018 | Extractive Question-AnsweringQuestion Answering | —Unverified | 0 |
| SKILL: Structured Knowledge Infusion for Large Language Models | May 17, 2022 | Knowledge GraphsTriviaQA | —Unverified | 0 |
| Smarnet: Teaching Machines to Read and Comprehend Like Human | Oct 8, 2017 | Question AnsweringReading Comprehension | —Unverified | 0 |
| Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference | Sep 16, 2023 | Instruction FollowingQuestion Answering | —Unverified | 0 |
| Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting | Jul 11, 2024 | ARCRAG | —Unverified | 0 |
| Studying Strategically: Learning to Mask for Closed-book QA | Dec 31, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate | Feb 9, 2024 | Question AnsweringTriviaQA | —Unverified | 0 |
| Tradeoffs in Sentence Selection Techniques for Open-Domain Question Answering | Sep 18, 2020 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| UnitedQA: A Hybrid Approach for Open Domain Question Answering | Jan 1, 2021 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| Vision-centric Token Compression in Large Language Model | Feb 2, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| When to Read Documents or QA History: On Unified and Selective Open-domain QA | Jun 7, 2023 | Natural QuestionsOpen-Domain Question Answering | —Unverified | 0 |
| Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models | May 16, 2025 | Question AnsweringRetrieval | —Unverified | 0 |
| Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity | Nov 15, 2024 | Contrastive LearningHallucination | —Unverified | 0 |
| LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models | May 31, 2024 | TriviaQATruthfulQA | CodeCode Available | 0 |
| KV Prediction for Improved Time to First Token | Oct 10, 2024 | Code CompletionCPU | CodeCode Available | 0 |
| RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation | Jun 9, 2024 | Document RankingNatural Questions | CodeCode Available | 0 |
| Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback | May 24, 2023 | TriviaQATruthfulQA | CodeCode Available | 0 |
| Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges | Jun 18, 2024 | TriviaQA | CodeCode Available | 0 |