SOTAVerified

TriviaQA

Papers

Showing 5160 of 124 papers

TitleStatusHype
From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic DataCode0
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-JudgesCode0
CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAGCode0
RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented GenerationCode0
LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language ModelsCode0
Accurate and Nuanced Open-QA Evaluation Through Textual EntailmentCode0
KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering0
Mitigating LLM Hallucinations via Conformal Abstention0
FIT-RAG: Black-Box RAG with Factual Information and Token Reduction0
Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents0
Show:102550
← PrevPage 6 of 13Next →

No leaderboard results yet.