| Data-Centric Human Preference Optimization with Rationales | Jul 19, 2024 | Hallucination | CodeCode Available | 0 | 5 |
| Mitigating Entity-Level Hallucination in Large Language Models | Jul 12, 2024 | HallucinationInformation Retrieval | CodeCode Available | 0 | 5 |
| "Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge Graphs | Sep 15, 2023 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in Biomedicine | Nov 14, 2024 | FormHallucination | CodeCode Available | 0 | 5 |
| Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models | Oct 15, 2024 | HallucinationLarge Language Model | CodeCode Available | 0 | 5 |
| Mitigating Hallucination in Abstractive Summarization with Domain-Conditional Mutual Information | Apr 15, 2024 | Abstractive Text SummarizationHallucination | CodeCode Available | 0 | 5 |
| DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models | May 31, 2024 | HallucinationModel Editing | CodeCode Available | 0 | 5 |
| MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRA | Dec 19, 2023 | Document ClassificationHallucination | CodeCode Available | 0 | 5 |
| Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization | Mar 3, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 0 | 5 |
| MedScore: Factuality Evaluation of Free-Form Medical Answers | May 24, 2025 | FormHallucination | CodeCode Available | 0 | 5 |
| MedTSS: transforming abstractive summarization of scientific articles with linguistic analysis and concept reinforcement | Jan 30, 2024 | Abstractive Text SummarizationArticles | CodeCode Available | 0 | 5 |
| Mitigating Hallucination in Fictional Character Role-Play | Jun 25, 2024 | HallucinationWorld Knowledge | CodeCode Available | 0 | 5 |
| Cross-modal Learning by Hallucinating Missing Modalities in RGB-D Vision | Jan 1, 2019 | Action RecognitionHallucination | CodeCode Available | 0 | 5 |
| MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback | Oct 17, 2024 | Fact VerificationHallucination | CodeCode Available | 0 | 5 |
| MAVEN-Fact: A Large-scale Event Factuality Detection Dataset | Jul 22, 2024 | Hallucination | CodeCode Available | 0 | 5 |
| MCiteBench: A Multimodal Benchmark for Generating Text with Citations | Mar 4, 2025 | HallucinationText Generation | CodeCode Available | 0 | 5 |
| Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations | Mar 27, 2024 | AttributeDiagnostic | CodeCode Available | 0 | 5 |
| Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models | Jul 23, 2024 | HallucinationMachine Translation | CodeCode Available | 0 | 5 |
| Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text Generation | Oct 25, 2023 | Data-to-Text GenerationHallucination | CodeCode Available | 0 | 5 |
| MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models | Oct 19, 2023 | HallucinationMathematical Reasoning | CodeCode Available | 0 | 5 |
| Low to High Dimensional Modality Hallucination using Aggregated Fields of View | Jul 13, 2020 | HallucinationVocal Bursts Intensity Prediction | CodeCode Available | 0 | 5 |
| Crafting In-context Examples according to LMs' Parametric Knowledge | Nov 16, 2023 | HallucinationIn-Context Learning | CodeCode Available | 0 | 5 |
| LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression | Mar 6, 2025 | BenchmarkingCommon Sense Reasoning | CodeCode Available | 0 | 5 |
| Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs | Jun 17, 2024 | counterfactualHallucination | CodeCode Available | 0 | 5 |
| A Unified Hallucination Mitigation Framework for Large Vision-Language Models | Sep 24, 2024 | HallucinationQuestion Answering | CodeCode Available | 0 | 5 |
| Correction with Backtracking Reduces Hallucination in Summarization | Oct 24, 2023 | Abstractive Text SummarizationHallucination | CodeCode Available | 0 | 5 |
| MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language Models | Feb 28, 2025 | Decision MakingHallucination | CodeCode Available | 0 | 5 |
| Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits Calibration | Jun 26, 2025 | HallucinationText Generation | CodeCode Available | 0 | 5 |
| Conversational Gold: Evaluating Personalized Conversational Search System using Gold Nuggets | Mar 12, 2025 | Answer GenerationConversational Search | CodeCode Available | 0 | 5 |
| LLMs and Memorization: On Quality and Specificity of Copyright Compliance | May 28, 2024 | HallucinationMemorization | CodeCode Available | 0 | 5 |
| Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework | Sep 24, 2024 | Benchmarkingcounterfactual | CodeCode Available | 0 | 5 |
| Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization | May 24, 2024 | Hallucination | CodeCode Available | 0 | 5 |
| LLM Inference Enhanced by External Knowledge: A Survey | May 30, 2025 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation | Sep 30, 2024 | Code GenerationHallucination | CodeCode Available | 0 | 5 |
| LLM Internal States Reveal Hallucination Risk Faced With a Query | Jul 3, 2024 | HallucinationResponse Generation | CodeCode Available | 0 | 5 |
| LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries | May 19, 2025 | HallucinationRetrieval | CodeCode Available | 0 | 5 |
| A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models | Jan 2, 2024 | Financial AnalysisHallucination | CodeCode Available | 0 | 5 |
| Linear Correlation in LM's Compositional Generalization and Hallucination | Feb 6, 2025 | Hallucination | CodeCode Available | 0 | 5 |
| Logic Query of Thoughts: Guiding Large Language Models to Answer Complex Logic Queries with Knowledge Graphs | Mar 17, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient Conversations | Sep 24, 2021 | Hallucination | CodeCode Available | 0 | 5 |
| Learning with privileged information via adversarial discriminative modality distillation | Oct 19, 2018 | Action RecognitionHallucination | CodeCode Available | 0 | 5 |
| Confidence Estimation for LLM-Based Dialogue State Tracking | Sep 15, 2024 | Dialogue State TrackingHallucination | CodeCode Available | 0 | 5 |
| Confidence-aware Denoised Fine-tuning of Off-the-shelf Models for Certified Robustness | Nov 13, 2024 | Adversarial RobustnessDenoising | CodeCode Available | 0 | 5 |
| Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant | Sep 17, 2024 | HallucinationInstruction Following | CodeCode Available | 0 | 5 |
| Learning on LLM Output Signatures for gray-box LLM Behavior Analysis | Mar 18, 2025 | Hallucination | CodeCode Available | 0 | 5 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 | 5 |
| Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts | Aug 21, 2023 | ArticlesHallucination | CodeCode Available | 0 | 5 |
| Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language Models | Feb 8, 2025 | Conformal PredictionDecision Making | CodeCode Available | 0 | 5 |
| Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks | Jul 1, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| Language Models Hallucinate, but May Excel at Fact Verification | Oct 23, 2023 | Fact VerificationHallucination | CodeCode Available | 0 | 5 |