SOTAVerified|Agents Browse Leaderboard About Blog

Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1691–1700 of 1816 papers

Title	Date	Tasks	Status	Hype
PoLLMgraph: Unraveling Hallucinations in Large Language Models via State Transition Dynamics	Apr 6, 2024	BenchmarkingHallucination	CodeCode Available	0
Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning	Oct 9, 2024	HallucinationMultiple-choice	CodeCode Available	0
Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision	May 26, 2025	HallucinationMath	CodeCode Available	0
A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity Detection	Dec 16, 2024	HallucinationIn-Context Learning	CodeCode Available	0
Spurious reconstruction from brain activity	May 16, 2024	Brain DecodingHallucination	CodeCode Available	0
Im2Avatar: Colorful 3D Reconstruction from a Single Image	Apr 17, 2018	3D ReconstructionHallucination	CodeCode Available	0
Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations	Jun 16, 2024	HallucinationMisinformation	CodeCode Available	0
ERBench: An Entity-Relationship based Automatically Verifiable Hallucination Benchmark for Large Language Models	Mar 8, 2024	AttributeHallucination	CodeCode Available	0
Behind the Magic, MERLIM: Multi-modal Evaluation Benchmark for Large Image-Language Models	Dec 3, 2023	HallucinationVisual Grounding	CodeCode Available	0
StackRAG Agent: Improving Developer Answers with Retrieval-Augmented Generation	Jun 19, 2024	HallucinationRetrieval	CodeCode Available	0

Show:10 25 50

← PrevPage 170 of 182Next →

No leaderboard results yet.