Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 526–550 of 1816 papers

Title	Date	Tasks	Status	Score
Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models	Dec 9, 2024	Hallucination	CodeCode Available	5
Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations	Mar 27, 2024	AttributeDiagnostic	CodeCode Available	5
MCiteBench: A Multimodal Benchmark for Generating Text with Citations	Mar 4, 2025	HallucinationText Generation	CodeCode Available	5
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem	Mar 6, 2024	BenchmarkingHallucination	CodeCode Available	5
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback	Oct 17, 2024	Fact VerificationHallucination	CodeCode Available	5
MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models	Oct 19, 2023	HallucinationMathematical Reasoning	CodeCode Available	5
MAVEN-Fact: A Large-scale Event Factuality Detection Dataset	Jul 22, 2024	Hallucination	CodeCode Available	5
DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation	Jun 13, 2024	BenchmarkingHallucination	CodeCode Available	5
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation Oncology	Apr 24, 2023	BenchmarkingDecision Making	CodeCode Available	5
LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression	Mar 6, 2025	BenchmarkingCommon Sense Reasoning	CodeCode Available	5
Low to High Dimensional Modality Hallucination using Aggregated Fields of View	Jul 13, 2020	HallucinationVocal Bursts Intensity Prediction	CodeCode Available	5
Behind the Magic, MERLIM: Multi-modal Evaluation Benchmark for Large Image-Language Models	Dec 3, 2023	HallucinationVisual Grounding	CodeCode Available	5
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models	Jul 23, 2024	HallucinationMachine Translation	CodeCode Available	5
MedScore: Factuality Evaluation of Free-Form Medical Answers	May 24, 2025	FormHallucination	CodeCode Available	5
Logic Query of Thoughts: Guiding Large Language Models to Answer Complex Logic Queries with Knowledge Graphs	Mar 17, 2024	HallucinationKnowledge Graphs	CodeCode Available	5
LLMs and Memorization: On Quality and Specificity of Copyright Compliance	May 28, 2024	HallucinationMemorization	CodeCode Available	5
LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models	Oct 13, 2024	HallucinationHallucination Evaluation	CodeCode Available	5
Deep CNN Denoiser and Multi-layer Neighbor Component Embedding for Face Hallucination	Jun 28, 2018	Face HallucinationHallucination	CodeCode Available	5
EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models	May 16, 2025	Hallucination	CodeCode Available	5
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation	Sep 30, 2024	Code GenerationHallucination	CodeCode Available	5
DecoPrompt : Decoding Prompts Reduces Hallucinations when Large Language Models Meet False Premises	Nov 12, 2024	Hallucination	CodeCode Available	5
LLM Inference Enhanced by External Knowledge: A Survey	May 30, 2025	HallucinationKnowledge Graphs	CodeCode Available	5
LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries	May 19, 2025	HallucinationRetrieval	CodeCode Available	5
LLM Internal States Reveal Hallucination Risk Faced With a Query	Jul 3, 2024	HallucinationResponse Generation	CodeCode Available	5
Deceptive Semantic Shortcuts on Reasoning Chains: How Far Can Models Go without Hallucination?	Nov 16, 2023	HallucinationSentence	CodeCode Available	5

Show:10 25 50

← PrevPage 22 of 73Next →

No leaderboard results yet.