SOTAVerified

Hallucination

Papers

Showing 151175 of 1816 papers

TitleStatusHype
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language ModelsCode2
Enabling Large Language Models to Generate Text with CitationsCode2
Medical Hallucinations in Foundation Models and Their Impact on HealthcareCode2
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsCode2
A Survey on Hallucination in Large Vision-Language ModelsCode2
Calibrated Self-Rewarding Vision Language ModelsCode2
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the KeyCode2
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention CausalityCode2
Dynamic Parametric Retrieval Augmented Generation for Test-time Knowledge EnhancementCode2
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph RefinementCode2
DyFo: A Training-Free Dynamic Focus Visual Search for Enhancing LMMs in Fine-Grained Visual UnderstandingCode2
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsCode2
KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual CheckingCode2
GeoBenchX: Benchmarking LLMs for Multistep Geospatial TasksCode1
Generating Natural Language Proofs with Verifier-Guided SearchCode1
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI FeedbackCode1
Detecting and Preventing Hallucinations in Large Vision Language ModelsCode1
Detecting Hallucinated Content in Conditional Neural Sequence GenerationCode1
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object PerceptionCode1
Adversarial Feature Hallucination Networks for Few-Shot LearningCode1
Deficiency-Aware Masked Transformer for Video InpaintingCode1
Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean DiscrepancyCode1
AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination EvaluationCode1
Advancing TTP Analysis: Harnessing the Power of Large Language Models with Retrieval Augmented GenerationCode1
3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure PriorCode1
Show:102550
← PrevPage 7 of 73Next →

No leaderboard results yet.