| Retrieval-Augmented Generation as Noisy In-Context Learning: A Unified Theory and Risk Bounds | Jun 3, 2025 | In-Context LearningNatural Questions | —Unverified | 0 |
| GenKI: Enhancing Open-Domain Question Answering with Knowledge Integration and Controllable Generation in Large Language Models | May 26, 2025 | Open-Domain Question AnsweringPassage Retrieval | CodeCode Available | 0 |
| HASH-RAG: Bridging Deep Hashing with Retriever for Efficient, Fine Retrieval and Augmented Generation | May 22, 2025 | ChunkingDeep Hashing | —Unverified | 0 |
| Semantic Caching of Contextual Summaries for Efficient Question-Answering with Language Models | May 16, 2025 | Question AnsweringRetrieval | —Unverified | 0 |
| DYNAMAX: Dynamic computing for Transformers and Mamba based architectures | Apr 29, 2025 | MambaTriviaQA | —Unverified | 0 |
| ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices | Mar 23, 2025 | HallucinationTriviaQA | —Unverified | 0 |
| CacheFocus: Dynamic Cache Re-Positioning for Efficient Retrieval-Augmented Generation | Feb 16, 2025 | Natural QuestionsRetrieval | —Unverified | 0 |
| Cost-Saving LLM Cascades with Early Abstention | Feb 13, 2025 | GSM8KMMLU | —Unverified | 0 |
| Self-Training Large Language Models for Tool-Use Without Demonstrations | Feb 9, 2025 | GSM8KMathematical Reasoning | —Unverified | 0 |
| Vision-centric Token Compression in Large Language Model | Feb 2, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| ASRank: Zero-Shot Re-Ranking with Answer Scent for Document Retrieval | Jan 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Generated and Retrieved Knowledge Combination Through Zero-shot Generation | Dec 25, 2024 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| DragonVerseQA: Open-Domain Long-Form Context-Aware Question-Answering | Dec 21, 2024 | ArticlesForm | CodeCode Available | 0 |
| Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity | Nov 15, 2024 | Contrastive LearningHallucination | —Unverified | 0 |
| Addressing Uncertainty in LLMs to Enhance Reliability in Generative AI | Nov 4, 2024 | Conformal PredictionPrediction | —Unverified | 0 |
| Rethinking Data Synthesis: A Teacher Model Training Recipe with Interpretation | Oct 27, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| KV Prediction for Improved Time to First Token | Oct 10, 2024 | Code CompletionCPU | CodeCode Available | 0 |
| Exploring Hint Generation Approaches in Open-Domain Question Answering | Sep 24, 2024 | Hint GenerationOpen-Domain Question Answering | CodeCode Available | 1 |
| SFR-RAG: Towards Contextually Faithful LLMs | Sep 16, 2024 | counterfactualHallucination | —Unverified | 0 |
| FastFiD: Improve Inference Efficiency of Open Domain Question Answering via Sentence Selection | Aug 12, 2024 | Answer GenerationDecoder | CodeCode Available | 1 |
| Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting | Jul 11, 2024 | ARCRAG | —Unverified | 0 |
| From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data | Jun 27, 2024 | HallucinationInformation Retrieval | CodeCode Available | 0 |
| Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges | Jun 18, 2024 | TriviaQA | CodeCode Available | 0 |
| CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG | Jun 17, 2024 | MisinformationRAG | CodeCode Available | 0 |
| RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation | Jun 9, 2024 | Document RankingNatural Questions | CodeCode Available | 0 |