| Retrieval-Augmented Generation as Noisy In-Context Learning: A Unified Theory and Risk Bounds | Jun 3, 2025 | In-Context LearningNatural Questions | —Unverified | 0 |
| GenKI: Enhancing Open-Domain Question Answering with Knowledge Integration and Controllable Generation in Large Language Models | May 26, 2025 | Open-Domain Question AnsweringPassage Retrieval | CodeCode Available | 0 |
| HASH-RAG: Bridging Deep Hashing with Retriever for Efficient, Fine Retrieval and Augmented Generation | May 22, 2025 | ChunkingDeep Hashing | —Unverified | 0 |
| Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models | May 16, 2025 | Question AnsweringRetrieval | —Unverified | 0 |
| DYNAMAX: Dynamic computing for Transformers and Mamba based architectures | Apr 29, 2025 | MambaTriviaQA | —Unverified | 0 |
| ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices | Mar 23, 2025 | HallucinationTriviaQA | —Unverified | 0 |
| CacheFocus: Dynamic Cache Re-Positioning for Efficient Retrieval-Augmented Generation | Feb 16, 2025 | Natural QuestionsRetrieval | —Unverified | 0 |
| Cost-Saving LLM Cascades with Early Abstention | Feb 13, 2025 | GSM8KMMLU | —Unverified | 0 |
| Self-Training Large Language Models for Tool-Use Without Demonstrations | Feb 9, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 |
| Vision-centric Token Compression in Large Language Model | Feb 2, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| ASRank: Zero-Shot Re-Ranking with Answer Scent for Document Retrieval | Jan 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation | Dec 25, 2024 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| DragonVerseQA: Open-Domain Long-Form Context-Aware Question-Answering | Dec 21, 2024 | ArticlesForm | CodeCode Available | 0 |
| Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity | Nov 15, 2024 | Contrastive LearningHallucination | —Unverified | 0 |
| Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI | Nov 4, 2024 | Conformal PredictionPrediction | —Unverified | 0 |
| Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation | Oct 27, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| KV Prediction for Improved Time to First Token | Oct 10, 2024 | Code CompletionCPU | CodeCode Available | 0 |
| Exploring Hint Generation Approaches in Open-Domain Question Answering | Sep 24, 2024 | Hint GenerationOpen-Domain Question Answering | CodeCode Available | 1 |
| SFR-RAG: Towards Contextually Faithful LLMs | Sep 16, 2024 | counterfactualHallucination | —Unverified | 0 |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Aug 12, 2024 | Answer GenerationDecoder | CodeCode Available | 1 |
| Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting | Jul 11, 2024 | ARCRAG | —Unverified | 0 |
| From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data | Jun 27, 2024 | HallucinationInformation Retrieval | CodeCode Available | 0 |
| Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges | Jun 18, 2024 | TriviaQA | CodeCode Available | 0 |
| CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG | Jun 17, 2024 | MisinformationRAG | CodeCode Available | 0 |
| RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation | Jun 9, 2024 | Document RankingNatural Questions | CodeCode Available | 0 |
| Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs | Jun 4, 2024 | Question AnsweringTriviaQA | CodeCode Available | 1 |
| LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models | May 31, 2024 | TriviaQATruthfulQA | CodeCode Available | 0 |
| Accurate and Nuanced Open-QA Evaluation Through Textual Entailment | May 26, 2024 | Natural Language InferenceOpen-Domain Question Answering | CodeCode Available | 0 |
| LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding | Apr 25, 2024 | GSM8KHellaSwag | CodeCode Available | 3 |
| KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering | Apr 24, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| Mitigating LLM Hallucinations via Conformal Abstention | Apr 4, 2024 | Conformal PredictionGenerative Question Answering | —Unverified | 0 |
| Multi-Granularity Guided Fusion-in-Decoder | Apr 3, 2024 | DecoderMulti-Task Learning | CodeCode Available | 1 |
| FIT-RAG: Black-Box RAG with Factual Information and Token Reduction | Mar 21, 2024 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| Unfamiliar Finetuning Examples Control How Language Models Hallucinate | Mar 8, 2024 | MMLUMultiple-choice | CodeCode Available | 1 |
| Harnessing Multi-Role Capabilities of Large Language Models for Open-Domain Question Answering | Mar 8, 2024 | Answer GenerationOpen-Domain Question Answering | CodeCode Available | 1 |
| Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents | Feb 27, 2024 | Known UnknownsQuestion Answering | —Unverified | 0 |
| Fine-Grained Self-Endorsement Improves Factuality and Reasoning | Feb 23, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| The Generative AI Paradox on Evaluation: What It Can Solve, It May Not Evaluate | Feb 9, 2024 | Question AnsweringTriviaQA | —Unverified | 0 |
| Attendre: Wait To Attend By Retrieval With Evicted Queries in Memory-Based Transformers for Long Context Processing | Jan 10, 2024 | DecoderReading Comprehension | —Unverified | 0 |
| Efficient Transformer Knowledge Distillation: A Performance Review | Nov 22, 2023 | Knowledge DistillationModel Compression | —Unverified | 0 |
| Noisy Pair Corrector for Dense Retrieval | Nov 7, 2023 | Code SearchRetrieval | —Unverified | 0 |
| A Bias-Variance-Covariance Decomposition of Kernel Scores for Generative Models | Oct 9, 2023 | Image GenerationQuestion Answering | CodeCode Available | 0 |
| Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference | Sep 16, 2023 | Instruction FollowingQuestion Answering | —Unverified | 0 |
| Generator-Retriever-Generator Approach for Open-Domain Question Answering | Jul 21, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| When to Read Documents or QA History: On Unified and Selective Open-domain QA | Jun 7, 2023 | Natural QuestionsOpen-Domain Question Answering | —Unverified | 0 |
| Exploiting Abstract Meaning Representation for Open-Domain Question Answering | May 26, 2023 | Abstract Meaning RepresentationDiversity | CodeCode Available | 1 |
| RFiD: Towards Rational Fusion-in-Decoder for Open-Domain Question Answering | May 26, 2023 | DecoderNatural Questions | CodeCode Available | 0 |
| Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback | May 24, 2023 | TriviaQATruthfulQA | CodeCode Available | 0 |
| Allies: Prompting Large Language Model with Beam Search | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing | May 19, 2023 | Fact CheckingNatural Questions | CodeCode Available | 0 |