SOTAVerified

TruthfulQA

Papers

Showing 4150 of 80 papers

TitleStatusHype
VarBench: Robust Language Model Benchmarking Through Dynamic Variable PerturbationCode0
Steering Without Side Effects: Improving Post-Deployment Control of Language ModelsCode0
Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided DecodingCode0
LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language ModelsCode0
Multi-Reference Preference Optimization for Large Language Models0
Machine Unlearning in Large Language ModelsCode1
Instruction Tuning With Loss Over InstructionsCode1
RLHF Workflow: From Reward Modeling to Online RLHFCode5
Harmonic LLMs are Trustworthy0
Student Data Paradox and Curious Case of Single Student-Tutor Model: Regressive Side Effects of Training LLMs for Personalized Learning0
Show:102550
← PrevPage 5 of 8Next →

No leaderboard results yet.