SOTAVerified|Agents Browse Leaderboard About Blog

TruthfulQA

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–60 of 80 papers

Title	Date	Tasks	Status	Hype
Self-Evaluation Improves Selective Generation in Large Language Models	Dec 14, 2023	Multiple-choiceTruthfulQA	—Unverified	0
Semantic Consistency for Assuring Reliability of Large Language Models	Aug 17, 2023	Question AnsweringText Generation	—Unverified	0
Shadows in the Attention: Contextual Perturbation and Representation Drift in the Dynamics of Hallucination in LLMs	May 22, 2025	HallucinationTruthfulQA	—Unverified	0
SkillAggregation: Reference-free LLM-Dependent Aggregation	Oct 14, 2024	ChatbotHallucination	—Unverified	0
Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency	Apr 4, 2025	BenchmarkingGSM8K	—Unverified	0
Teaching language models to support answers with verified quotes	Mar 21, 2022	Fact CheckingNatural Questions	—Unverified	0
Towards Multilingual LLM Evaluation for European Languages	Oct 11, 2024	ARCGSM8K	—Unverified	0
TruthFlow: Truthful LLM Generation via Representation Flow Correction	Feb 6, 2025	HallucinationTruthfulQA	—Unverified	0
Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages	Dec 1, 2024	ARCMultiple-choice	—Unverified	0
Uncertainty-aware Language Modeling for Selective Question Answering	Nov 26, 2023	Language ModelingLanguage Modelling	—Unverified	0

Show:10 25 50

← PrevPage 6 of 8Next →

No leaderboard results yet.