| Self-Training Large Language Models for Tool-Use Without Demonstrations | Feb 9, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 |
| SFR-RAG: Towards Contextually Faithful LLMs | Sep 16, 2024 | counterfactualHallucination | —Unverified | 0 |
| ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices | Mar 23, 2025 | HallucinationTriviaQA | —Unverified | 0 |
| Simple and Effective Semi-Supervised Question Answering | Apr 2, 2018 | Extractive Question-AnsweringQuestion Answering | —Unverified | 0 |
| SKILL: Structured Knowledge Infusion for Large Language Models | May 17, 2022 | Knowledge GraphsTriviaQA | —Unverified | 0 |
| Smarnet: Teaching Machines to Read and Comprehend Like Human | Oct 8, 2017 | Question AnsweringReading Comprehension | —Unverified | 0 |
| Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference | Sep 16, 2023 | Instruction FollowingQuestion Answering | —Unverified | 0 |
| Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting | Jul 11, 2024 | ARCRAG | —Unverified | 0 |
| Studying Strategically: Learning to Mask for Closed-book QA | Dec 31, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate | Feb 9, 2024 | Question AnsweringTriviaQA | —Unverified | 0 |