SOTAVerified|Agents Browse Leaderboard About

TruthfulQA

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 71–80 of 80 papers

Title	Date	Tasks	Status	Hype
LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models	May 31, 2024	TriviaQATruthfulQA	CodeCode Available	0
SaGE: Evaluating Moral Consistency in Large Language Models	Feb 21, 2024	Decision MakingHellaSwag	CodeCode Available	0
Unsupervised Elicitation of Language Models	Jun 11, 2025	GSM8KTruthfulQA	CodeCode Available	0
VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation	Jun 25, 2024	ARCBenchmarking	CodeCode Available	0
Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding	Jun 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	0
DeLTa: A Decoding Strategy based on Logit Trajectory Prediction Improves Factuality and Reasoning Ability	Mar 4, 2025	GSM8KLogical Reasoning	CodeCode Available	0
Truth Knows No Language: Evaluating Truthfulness Beyond English	Feb 13, 2025	InformativenessMachine Translation	CodeCode Available	0
Truth Neurons	May 18, 2025	TruthfulQA	CodeCode Available	0
CHAIR -- Classifier of Hallucination as Improver	Jan 5, 2025	HallucinationMMLU	CodeCode Available	0
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback	May 24, 2023	TriviaQATruthfulQA	CodeCode Available	0

Show:10 25 50

← PrevPage 8 of 8Next →

No leaderboard results yet.