| From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data | Jun 27, 2024 | HallucinationInformation Retrieval | CodeCode Available | 0 |
| Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges | Jun 18, 2024 | TriviaQA | CodeCode Available | 0 |
| CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG | Jun 17, 2024 | MisinformationRAG | CodeCode Available | 0 |
| RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation | Jun 9, 2024 | Document RankingNatural Questions | CodeCode Available | 0 |
| LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models | May 31, 2024 | TriviaQATruthfulQA | CodeCode Available | 0 |
| Accurate and Nuanced Open-QA Evaluation Through Textual Entailment | May 26, 2024 | Natural Language InferenceOpen-Domain Question Answering | CodeCode Available | 0 |
| KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering | Apr 24, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| Mitigating LLM Hallucinations via Conformal Abstention | Apr 4, 2024 | Conformal PredictionGenerative Question Answering | —Unverified | 0 |
| FIT-RAG: Black-Box RAG with Factual Information and Token Reduction | Mar 21, 2024 | Open-Domain Question AnsweringQuestion Answering | —Unverified | 0 |
| Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents | Feb 27, 2024 | Known UnknownsQuestion Answering | —Unverified | 0 |