SOTAVerified

TruthfulQA

Papers

Showing 3140 of 80 papers

TitleStatusHype
Cost-Saving LLM Cascades with Early Abstention0
LLMAuditor: A Framework for Auditing Large Language Models Using Human-in-the-Loop0
DYNAMAX: Dynamic computing for Transformers and Mamba based architectures0
A Debate-Driven Experiment on LLM Hallucinations and Accuracy0
Efficient MAP Estimation of LLM Judgment Performance with Prior Transfer0
Elastic Weight Consolidation for Full-Parameter Continual Pre-Training of Gemma20
Evaluating Consistencies in LLM responses through a Semantic Clustering of Question Answering0
GRATH: Gradual Self-Truthifying for Large Language Models0
Harmonic LLMs are Trustworthy0
Instruction Tuning with Human Curriculum0
Show:102550
← PrevPage 4 of 8Next →

No leaderboard results yet.