SOTAVerified

Hallucination

Papers

Showing 6170 of 1816 papers

TitleStatusHype
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language ModelsCode3
KnowAgent: Knowledge-Augmented Planning for LLM-Based AgentsCode3
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language ModelsCode3
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language ModelsCode3
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language ModelsCode3
Automated Hypothesis Validation with Agentic Sequential FalsificationsCode3
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language ModelsCode2
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsCode2
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language ModelsCode2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine TranslationCode2
Show:102550
← PrevPage 7 of 182Next →

No leaderboard results yet.