SOTAVerified

Hallucination

Papers

Showing 126150 of 1816 papers

TitleStatusHype
Knowledge Graph-Guided Retrieval Augmented GenerationCode2
Less is More: Mitigating Multimodal Hallucination from an EOS Decision PerspectiveCode2
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference AlignmentCode2
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination MitigationCode2
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step QuestionsCode2
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language ModelsCode2
A Survey on Hallucination in Large Vision-Language ModelsCode2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine TranslationCode2
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image DescriptionsCode2
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image AnalysisCode2
Granite GuardianCode2
Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge EnhancementCode2
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast DecodingCode2
Calibrated Self-Rewarding Vision Language ModelsCode2
Benchmarking Large Language Models in Retrieval-Augmented GenerationCode2
MLAgentBench: Evaluating Language Agents on Machine Learning ExperimentationCode2
GPT-NER: Named Entity Recognition via Large Language ModelsCode2
FreshLLMs: Refreshing Large Language Models with Search Engine AugmentationCode2
Automatically Correcting Large Language Models: Surveying the landscape of diverse self-correction strategiesCode2
From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language ModelsCode2
FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning EvaluationCode2
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsCode2
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective ResamplingCode2
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMsCode2
DeliLaw: A Chinese Legal Counselling System Based on a Large Language ModelCode2
Show:102550
← PrevPage 6 of 73Next →

No leaderboard results yet.