SOTAVerified|Agents Browse Leaderboard About

TruthfulQA

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–60 of 80 papers

Title	Date	Tasks	Status	Hype
When Hindsight is Not 20/20: Testing Limits on Reflective Thinking in Large Language Models	Apr 14, 2024	TruthfulQA	CodeCode Available	0
PoLLMgraph: Unraveling Hallucinations in Large Language Models via State Transition Dynamics	Apr 6, 2024	BenchmarkingHallucination	CodeCode Available	0
PRobELM: Plausibility Ranking Evaluation for Language Models	Apr 4, 2024	Question AnsweringTruthfulQA	—Unverified	0
Non-Linear Inference Time Intervention: Improving LLM Truthfulness	Mar 27, 2024	Large Language ModelMultiple-choice	CodeCode Available	1
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation	Mar 3, 2024	HallucinationTruthfulQA	CodeCode Available	2
TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space	Feb 27, 2024	Contrastive LearningHallucination	CodeCode Available	2
SaGE: Evaluating Moral Consistency in Large Language Models	Feb 21, 2024	Decision MakingHellaSwag	CodeCode Available	0
LLMAuditor: A Framework for Auditing Large Language Models Using Human-in-the-Loop	Feb 14, 2024	HallucinationTruthfulQA	—Unverified	0
Self-Alignment for Factuality: Mitigating Hallucinations in LLMs via Self-Evaluation	Feb 14, 2024	TruthfulQA	—Unverified	0
GRATH: Gradual Self-Truthifying for Large Language Models	Jan 22, 2024	TruthfulQA	—Unverified	0

Show:10 25 50

← PrevPage 6 of 8Next →

No leaderboard results yet.