| `Generalization is hallucination' through the lens of tensor completions | Feb 24, 2025 | HallucinationPosition | —Unverified | 0 |
| LLM-QE: Improving Query Expansion by Aligning Large Language Models with Ranking Preferences | Feb 24, 2025 | HallucinationInformation Retrieval | CodeCode Available | 1 |
| LettuceDetect: A Hallucination Detection Framework for RAG Applications | Feb 24, 2025 | 8kGPU | CodeCode Available | 4 |
| Uncertainty-Aware Fusion: An Ensemble Framework for Mitigating Hallucinations in Large Language Models | Feb 22, 2025 | HallucinationQuestion Answering | —Unverified | 0 |
| ZiGong 1.0: A Large Language Model for Financial Credit | Feb 22, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination | Feb 22, 2025 | HallucinationText Generation | —Unverified | 0 |
| PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning | Feb 21, 2025 | Hallucination | CodeCode Available | 2 |
| The Role of Background Information in Reducing Object Hallucination in Vision-Language Models: Insights from Cutoff API Prompting | Feb 21, 2025 | HallucinationObject | —Unverified | 0 |
| Verify when Uncertain: Beyond Self-Consistency in Black Box Hallucination Detection | Feb 20, 2025 | Hallucination | —Unverified | 0 |
| Hallucination Detection in Large Language Models with Metamorphic Relations | Feb 20, 2025 | Hallucination | —Unverified | 0 |
| Large Language Models Struggle to Describe the Haystack without Human Help: Human-in-the-loop Evaluation of LLMs | Feb 20, 2025 | HallucinationTopic Models | —Unverified | 0 |
| MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models | Feb 20, 2025 | Decision MakingHallucination | —Unverified | 0 |
| SegSub: Evaluating Robustness to Knowledge Conflicts and Hallucinations in Vision-Language Models | Feb 19, 2025 | counterfactualHallucination | CodeCode Available | 0 |
| OpenSearch-SQL: Enhancing Text-to-SQL with Dynamic Few-shot and Consistency Alignment | Feb 19, 2025 | HallucinationInstruction Following | —Unverified | 0 |
| Detecting LLM Fact-conflicting Hallucinations Enhanced by Temporal-logic-based Reasoning | Feb 19, 2025 | Hallucination | —Unverified | 0 |
| REFIND: Retrieval-Augmented Factuality Hallucination Detection in Large Language Models | Feb 19, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| What are Models Thinking about? Understanding Large Language Model Hallucinations "Psychology" through Model Inner State Analysis | Feb 19, 2025 | HallucinationLanguage Modeling | —Unverified | 0 |
| TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation | Feb 19, 2025 | Dataset GenerationGSM8K | CodeCode Available | 0 |
| Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models | Feb 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CutPaste&Find: Efficient Multimodal Hallucination Detector with Visual-aid Knowledge Base | Feb 18, 2025 | AttributeHallucination | —Unverified | 0 |
| R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs | Feb 18, 2025 | HallucinationKnowledge Graphs | CodeCode Available | 1 |
| How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild | Feb 18, 2025 | ArticlesHallucination | CodeCode Available | 0 |
| Unveiling the Magic of Code Reasoning through Hypothesis Decomposition and Amendment | Feb 17, 2025 | HallucinationLogical Reasoning | CodeCode Available | 2 |
| Can Your Uncertainty Scores Detect Hallucinated Entity? | Feb 17, 2025 | HallucinationSentence | —Unverified | 0 |
| Valuable Hallucinations: Realizable Non-realistic Propositions | Feb 16, 2025 | Hallucination | —Unverified | 0 |