| Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion | Jan 2, 2025 | DiversityHallucination | —Unverified | 0 |
| Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking | Jan 2, 2025 | HallucinationText Generation | —Unverified | 0 |
| VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification | Jan 1, 2025 | Hallucination | CodeCode Available | 1 |
| Octopus: Alleviating Hallucination via Dynamic Contrastive Decoding | Jan 1, 2025 | Hallucination | CodeCode Available | 1 |
| Stop Learning it all to Mitigate Visual Hallucination, Focus on the Hallucination Target. | Jan 1, 2025 | AllHallucination | —Unverified | 0 |
| VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models | Jan 1, 2025 | Hallucination | —Unverified | 0 |
| Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention | Jan 1, 2025 | HallucinationResponse Generation | CodeCode Available | 2 |
| POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation | Jan 1, 2025 | HallucinationReasoning Segmentation | —Unverified | 0 |
| RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human Feedback | Jan 1, 2025 | HallucinationImage Comprehension | CodeCode Available | 0 |
| IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models | Jan 1, 2025 | HallucinationMultiple-choice | —Unverified | 0 |
| A review of faithfulness metrics for hallucination assessment in Large Language Models | Dec 31, 2024 | BenchmarkingHallucination | —Unverified | 0 |
| Distilling Desired Comments for Enhanced Code Review with Large Language Models | Dec 29, 2024 | Dataset DistillationHallucination | —Unverified | 0 |
| HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language Models | Dec 29, 2024 | HallucinationObject | CodeCode Available | 0 |
| Is Your Text-to-Image Model Robust to Caption Noise? | Dec 27, 2024 | DescriptiveHallucination | —Unverified | 0 |
| An End-to-End Depth-Based Pipeline for Selfie Image Rectification | Dec 26, 2024 | Depth EstimationHallucination | —Unverified | 0 |
| MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models | Dec 25, 2024 | Hallucinationreinforcement-learning | —Unverified | 0 |
| From Hallucinations to Facts: Enhancing Language Models with Curated Knowledge Graphs | Dec 24, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| Improving Factuality with Explicit Working Memory | Dec 24, 2024 | Fact CheckingHallucination | —Unverified | 0 |
| Extract Free Dense Misalignment from CLIP | Dec 24, 2024 | HallucinationImage Generation | CodeCode Available | 1 |
| Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation | Dec 24, 2024 | Graph Question AnsweringHallucination | CodeCode Available | 1 |
| Multimodal Preference Data Synthetic Alignment with Reward Model | Dec 23, 2024 | 2kCaption Generation | CodeCode Available | 0 |
| CiteBART: Learning to Generate Citations for Local Citation Recommendation | Dec 23, 2024 | Citation PredictionCitation Recommendation | CodeCode Available | 0 |
| AlzheimerRAG: Multimodal Retrieval Augmented Generation for PubMed articles | Dec 21, 2024 | ArticlesDecision Making | —Unverified | 0 |
| Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage | Dec 20, 2024 | AttributeBenchmarking | —Unverified | 0 |
| Logical Consistency of Large Language Models in Fact-checking | Dec 20, 2024 | Fact CheckingHallucination | —Unverified | 0 |