SOTAVerified|Agents Browse Leaderboard About Blog

TruthfulQA

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 80 papers

Title	Date	Tasks	Status	Hype
Truth Knows No Language: Evaluating Truthfulness Beyond English	Feb 13, 2025	InformativenessMachine Translation	CodeCode Available	0
Cost-Saving LLM Cascades with Early Abstention	Feb 13, 2025	GSM8KMMLU	—Unverified	0
Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models	Feb 12, 2025	Mathematical ReasoningMMLU	—Unverified	0
Multi-Agent Reinforcement Learning with Focal Diversity Optimization	Feb 6, 2025	DiversityMulti-agent Reinforcement Learning	CodeCode Available	0
TruthFlow: Truthful LLM Generation via Representation Flow Correction	Feb 6, 2025	HallucinationTruthfulQA	—Unverified	0
CHAIR -- Classifier of Hallucination as Improver	Jan 5, 2025	HallucinationMMLU	CodeCode Available	0
(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and Challenges	Jan 3, 2025	Multiple-choiceQuestion Answering	CodeCode Available	0
Monty Hall and Optimized Conformal Prediction to Improve Decision-Making with LLMs	Dec 31, 2024	Conformal PredictionDecision Making	—Unverified	0
Mitigating Adversarial Attacks in LLMs through Defensive Suffix Generation	Dec 18, 2024	TruthfulQA	—Unverified	0
Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages	Dec 1, 2024	ARCMultiple-choice	—Unverified	0

Show:10 25 50

← PrevPage 4 of 8Next →

No leaderboard results yet.