| Editing Factual Knowledge and Explanatory Ability of Medical Large Language Models | Feb 28, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| Visually Dehallucinative Instruction Generation: Know What You Don't Know | Feb 15, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| HALOS: Hallucination-free Organ Segmentation after Organ Resection Surgery | Mar 14, 2023 | AnatomyDeep Learning | CodeCode Available | 0 |
| An Investigation of Evaluation Metrics for Automated Medical Note Generation | May 27, 2023 | Graph EmbeddingHallucination | CodeCode Available | 0 |
| Teacher-Student Adversarial Depth Hallucination to Improve Face Recognition | Apr 6, 2021 | Face RecognitionGenerative Adversarial Network | CodeCode Available | 0 |
| HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection | Sep 26, 2024 | Hallucination | CodeCode Available | 0 |
| Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models | Nov 3, 2024 | HallucinationInstruction Following | CodeCode Available | 0 |
| HALO: Hallucination Analysis and Learning Optimization to Empower LLMs with Retrieval-Augmented Context for Guided Clinical Decision Making | Sep 16, 2024 | Answer GenerationDecision Making | CodeCode Available | 0 |
| An Inflectional Database for Gitksan | Jun 1, 2022 | Data AugmentationHallucination | CodeCode Available | 0 |
| HalluShift: Measuring Distribution Shifts towards Hallucination Detection in LLMs | Apr 13, 2025 | HallucinationMisinformation | CodeCode Available | 0 |
| HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation | Jun 11, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| A Comparative Study on Language Models for Task-Oriented Dialogue Systems | Jan 21, 2022 | Dialogue State TrackingHallucination | CodeCode Available | 0 |
| Characterizing Context Influence and Hallucination in Summarization | Oct 3, 2024 | Hallucination | CodeCode Available | 0 |
| DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping | Sep 11, 2023 | HallucinationInstruction Following | CodeCode Available | 0 |
| Zero-Resource Hallucination Prevention for Large Language Models | Sep 6, 2023 | Hallucination | CodeCode Available | 0 |
| Tensor feature hallucination for few-shot learning | Jun 9, 2021 | Data AugmentationFew-Shot Learning | CodeCode Available | 0 |
| CHAIR -- Classifier of Hallucination as Improver | Jan 5, 2025 | HallucinationMMLU | CodeCode Available | 0 |
| Chainpoll: A high efficacy method for LLM hallucination detection | Oct 22, 2023 | HallucinationRetrieval-augmented Generation | CodeCode Available | 0 |
| TUBench: Benchmarking Large Vision-Language Models on Trustworthiness with Unanswerable Questions | Oct 5, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language Models | Dec 29, 2024 | HallucinationObject | CodeCode Available | 0 |
| VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images | Aug 28, 2024 | Hallucination | CodeCode Available | 0 |
| Reducing Quantity Hallucinations in Abstractive Summarization | Sep 28, 2020 | Abstractive Text SummarizationHallucination | CodeCode Available | 0 |
| ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination Detection | Jul 18, 2024 | Cross-Lingual TransferHallucination | CodeCode Available | 0 |
| HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN | Dec 10, 2019 | Action AnticipationAction Classification | CodeCode Available | 0 |
| Re-Ex: Revising after Explanation Reduces the Factual Errors in LLM Responses | Feb 27, 2024 | Hallucination | CodeCode Available | 0 |