| HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs | Feb 25, 2024 | BenchmarkingChatbot | CodeCode Available | 0 |
| A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models | Jan 2, 2024 | Financial AnalysisHallucination | CodeCode Available | 0 |
| Automating Feedback Analysis in Surgical Training: Detection, Categorization, and Assessment | Dec 1, 2024 | Action DetectionActivity Detection | CodeCode Available | 0 |
| Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated Texts | Sep 25, 2024 | Hallucination | CodeCode Available | 0 |
| Confidence-aware Denoised Fine-tuning of Off-the-shelf Models for Certified Robustness | Nov 13, 2024 | Adversarial RobustnessDenoising | CodeCode Available | 0 |
| How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities | Nov 15, 2023 | EthicsFairness | CodeCode Available | 0 |
| Prioritizing Image-Related Tokens Enhances Vision-Language Pre-Training | May 13, 2025 | HallucinationLarge Language Model | CodeCode Available | 0 |
| How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild | Feb 18, 2025 | ArticlesHallucination | CodeCode Available | 0 |
| Step-by-step Instructions and a Simple Tabular Output Format Improve the Dependency Parsing Accuracy of LLMs | Jun 11, 2025 | Dependency ParsingHallucination | CodeCode Available | 0 |
| How Helpful is Inverse Reinforcement Learning for Table-to-Text Generation? | Aug 1, 2021 | Domain AdaptationHallucination | CodeCode Available | 0 |
| A Claim Decomposition Benchmark for Long-form Answer Verification | Oct 16, 2024 | FormHallucination | CodeCode Available | 0 |
| Entity-driven Fact-aware Abstractive Summarization of Biomedical Literature | Mar 30, 2022 | Abstractive Text SummarizationArticles | CodeCode Available | 0 |
| HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models | Mar 17, 2025 | HallucinationQuestion Answering | CodeCode Available | 0 |
| Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models | Oct 15, 2024 | HallucinationLarge Language Model | CodeCode Available | 0 |
| Projected Distribution Loss for Image Enhancement | Dec 16, 2020 | DeblurringDemosaicking | CodeCode Available | 0 |
| Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion Models | Jun 24, 2024 | HallucinationImage Generation | CodeCode Available | 0 |
| What's Wrong? Refining Meeting Summaries with LLM Feedback | Jul 16, 2024 | HallucinationInformativeness | CodeCode Available | 0 |
| ToW: Thoughts of Words Improve Reasoning in Large Language Models | Oct 21, 2024 | Data AugmentationHallucination | CodeCode Available | 0 |
| Prompt Injection Detection and Mitigation via AI Multi-Agent NLP Frameworks | Mar 14, 2025 | Hallucination | CodeCode Available | 0 |
| HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Sep 30, 2024 | HallucinationObject | CodeCode Available | 0 |
| Stress-Testing Multimodal Foundation Models for Crystallographic Reasoning | Jun 16, 2025 | HallucinationSpatial Interpolation | CodeCode Available | 0 |
| Enhancing the General Agent Capabilities of Low-Parameter LLMs through Tuning and Multi-Branch Reasoning | Mar 29, 2024 | HallucinationTask Planning | CodeCode Available | 0 |
| ProveRAG: Provenance-Driven Vulnerability Analysis with Automated Retrieval-Augmented LLMs | Oct 22, 2024 | ChunkingHallucination | CodeCode Available | 0 |
| A Unified Hallucination Mitigation Framework for Large Vision-Language Models | Sep 24, 2024 | HallucinationQuestion Answering | CodeCode Available | 0 |
| HaRiM^+: Evaluating Summary Quality with Hallucination Risk | Nov 22, 2022 | Automated Writing EvaluationDecoder | CodeCode Available | 0 |
| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 |
| Are Large Language Models Good at Utility Judgments? | Mar 28, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 |
| Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses | Jul 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Pushing the Limits of Low-Resource Morphological Inflection | Aug 16, 2019 | Cross-Lingual TransferDecoder | CodeCode Available | 0 |
| Embedding Hallucination for Few-Shot Language Fine-tuning | May 3, 2022 | Data AugmentationHallucination | CodeCode Available | 0 |
| A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation | Jun 11, 2024 | Hallucination | CodeCode Available | 0 |
| AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language Models | Mar 13, 2024 | Hallucination | CodeCode Available | 0 |
| Verbosity Veracity: Demystify Verbosity Compensation Behavior of Large Language Models | Nov 12, 2024 | Hallucination | CodeCode Available | 0 |
| CiteBART: Learning to Generate Citations for Local Citation Recommendation | Dec 23, 2024 | Citation PredictionCitation Recommendation | CodeCode Available | 0 |
| Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports | Apr 9, 2024 | FormHallucination | CodeCode Available | 0 |
| Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews | May 19, 2023 | Decision MakingHallucination | CodeCode Available | 0 |
| Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal Reasoning | Feb 11, 2025 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Anticipation-Free Training for Simultaneous Machine Translation | Jan 30, 2022 | HallucinationMachine Translation | CodeCode Available | 0 |
| Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information | May 29, 2025 | Hallucination | CodeCode Available | 0 |
| Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant | Sep 17, 2024 | HallucinationInstruction Following | CodeCode Available | 0 |
| Handwritten Code Recognition for Pen-and-Paper CS Education | Aug 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Handling Ontology Gaps in Semantic Parsing | Jun 27, 2024 | HallucinationQuestion Answering | CodeCode Available | 0 |
| Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning | Jun 6, 2023 | Hallucinationreinforcement-learning | CodeCode Available | 0 |
| Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization | Oct 30, 2023 | Hallucination | CodeCode Available | 0 |
| Treble Counterfactual VLMs: A Causal Approach to Hallucination | Mar 8, 2025 | Autonomous Drivingcounterfactual | CodeCode Available | 0 |
| Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization | Sep 22, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild | Mar 7, 2024 | HallucinationQuestion Answering | CodeCode Available | 0 |
| TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation | Feb 19, 2025 | Dataset GenerationGSM8K | CodeCode Available | 0 |
| ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope Questions | Oct 18, 2024 | HallucinationNatural Questions | CodeCode Available | 0 |
| Visually Dehallucinative Instruction Generation | Feb 13, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |