| HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs | Feb 25, 2024 | BenchmarkingChatbot | CodeCode Available | 0 |
| A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models | Jan 2, 2024 | Financial AnalysisHallucination | CodeCode Available | 0 |
| Automating Feedback Analysis in Surgical Training: Detection, Categorization, and Assessment | Dec 1, 2024 | Action DetectionActivity Detection | CodeCode Available | 0 |
| Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts | Sep 25, 2024 | Hallucination | CodeCode Available | 0 |
| Confidence-aware Denoised Fine-tuning of Off-the-shelf Models for Certified Robustness | Nov 13, 2024 | Adversarial RobustnessDenoising | CodeCode Available | 0 |
| How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities | Nov 15, 2023 | EthicsFairness | CodeCode Available | 0 |
| Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training | May 13, 2025 | HallucinationLarge Language Model | CodeCode Available | 0 |
| How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild | Feb 18, 2025 | ArticlesHallucination | CodeCode Available | 0 |
| Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs | Jun 11, 2025 | Dependency ParsingHallucination | CodeCode Available | 0 |
| How Helpful is Inverse Reinforcement Learning for Table-to-Text Generation? | Aug 1, 2021 | Domain AdaptationHallucination | CodeCode Available | 0 |
| A Claim Decomposition Benchmark for Long-form Answer Verification | Oct 16, 2024 | FormHallucination | CodeCode Available | 0 |
| Entity-driven Fact-aware Abstractive Summarization of Biomedical Literature | Mar 30, 2022 | Abstractive Text SummarizationArticles | CodeCode Available | 0 |
| HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models | Mar 17, 2025 | HallucinationQuestion Answering | CodeCode Available | 0 |
| Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models | Oct 15, 2024 | HallucinationLarge Language Model | CodeCode Available | 0 |
| Projected Distribution Loss for Image Enhancement | Dec 16, 2020 | DeblurringDemosaicking | CodeCode Available | 0 |
| Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models | Jun 24, 2024 | HallucinationImage Generation | CodeCode Available | 0 |
| What's Wrong? Refining Meeting Summaries with LLM Feedback | Jul 16, 2024 | HallucinationInformativeness | CodeCode Available | 0 |
| ToW: Thoughts of Words Improve Reasoning in Large Language Models | Oct 21, 2024 | Data AugmentationHallucination | CodeCode Available | 0 |
| Prompt Injection Detection and Mitigation via AI Multi-Agent NLP Frameworks | Mar 14, 2025 | Hallucination | CodeCode Available | 0 |
| HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Sep 30, 2024 | HallucinationObject | CodeCode Available | 0 |
| Stress-Testing Multimodal Foundation Models for Crystallographic Reasoning | Jun 16, 2025 | HallucinationSpatial Interpolation | CodeCode Available | 0 |
| Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning | Mar 29, 2024 | HallucinationTask Planning | CodeCode Available | 0 |
| ProveRAG: Provenance-Driven Vulnerability Analysis with Automated Retrieval-Augmented LLMs | Oct 22, 2024 | ChunkingHallucination | CodeCode Available | 0 |
| A Unified Hallucination Mitigation Framework for Large Vision-Language Models | Sep 24, 2024 | HallucinationQuestion Answering | CodeCode Available | 0 |
| HaRiM^+: Evaluating Summary Quality with Hallucination Risk | Nov 22, 2022 | Automated Writing EvaluationDecoder | CodeCode Available | 0 |