SOTAVerified

TruthfulQA

Papers

Showing 2650 of 80 papers

TitleStatusHype
Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency0
More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment0
When Persuasion Overrides Truth in Multi-Agent LLM Debates: Introducing a Confidence-Weighted Persuasion Override Rate (CW-POR)0
DeLTa: A Decoding Strategy based on Logit Trajectory Prediction Improves Factuality and Reasoning AbilityCode0
Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models0
Truth Knows No Language: Evaluating Truthfulness Beyond EnglishCode0
Cost-Saving LLM Cascades with Early Abstention0
Selective Self-to-Supervised Fine-Tuning for Generalization in Large Language Models0
Multi-Agent Reinforcement Learning with Focal Diversity OptimizationCode0
TruthFlow: Truthful LLM Generation via Representation Flow Correction0
CHAIR -- Classifier of Hallucination as ImproverCode0
(WhyPHI) Fine-Tuning PHI-3 for Multiple-Choice Question Answering: Methodology, Results, and ChallengesCode0
Monty Hall and Optimized Conformal Prediction to Improve Decision-Making with LLMs0
Mitigating Adversarial Attacks in LLMs through Defensive Suffix Generation0
Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages0
Layer Importance and Hallucination Analysis in Large Language Models via Enhanced Activation Variance-Sparsity0
Maintaining Informative Coherence: Migrating Hallucinations in Large Language Models via Absorbing Markov Chains0
A Debate-Driven Experiment on LLM Hallucinations and Accuracy0
Evaluating Consistencies in LLM responses through a Semantic Clustering of Question Answering0
Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning0
SkillAggregation: Reference-free LLM-Dependent Aggregation0
NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language ModelsCode0
Towards Multilingual LLM Evaluation for European Languages0
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts0
A test suite of prompt injection attacks for LLM-based machine translationCode0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.