| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 |
| Are Large Language Models Good at Utility Judgments? | Mar 28, 2024 | Answer GenerationBenchmarking | CodeCode Available | 0 |
| Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses | Jul 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Pushing the Limits of Low-Resource Morphological Inflection | Aug 16, 2019 | Cross-Lingual TransferDecoder | CodeCode Available | 0 |
| Embedding Hallucination for Few-Shot Language Fine-tuning | May 3, 2022 | Data AugmentationHallucination | CodeCode Available | 0 |
| A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation | Jun 11, 2024 | Hallucination | CodeCode Available | 0 |
| AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language Models | Mar 13, 2024 | Hallucination | CodeCode Available | 0 |
| Verbosity Veracity: Demystify Verbosity Compensation Behavior of Large Language Models | Nov 12, 2024 | Hallucination | CodeCode Available | 0 |
| CiteBART: Learning to Generate Citations for Local Citation Recommendation | Dec 23, 2024 | Citation PredictionCitation Recommendation | CodeCode Available | 0 |
| Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports | Apr 9, 2024 | FormHallucination | CodeCode Available | 0 |
| Appraising the Potential Uses and Harms of LLMs for Medical Systematic Reviews | May 19, 2023 | Decision MakingHallucination | CodeCode Available | 0 |
| Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal Reasoning | Feb 11, 2025 | HallucinationIn-Context Learning | CodeCode Available | 0 |
| Anticipation-Free Training for Simultaneous Machine Translation | Jan 30, 2022 | HallucinationMachine Translation | CodeCode Available | 0 |
| Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information | May 29, 2025 | Hallucination | CodeCode Available | 0 |
| Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant | Sep 17, 2024 | HallucinationInstruction Following | CodeCode Available | 0 |
| Handwritten Code Recognition for Pen-and-Paper CS Education | Aug 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Handling Ontology Gaps in Semantic Parsing | Jun 27, 2024 | HallucinationQuestion Answering | CodeCode Available | 0 |
| Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement Learning | Jun 6, 2023 | Hallucinationreinforcement-learning | CodeCode Available | 0 |
| Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization | Oct 30, 2023 | Hallucination | CodeCode Available | 0 |
| Treble Counterfactual VLMs: A Causal Approach to Hallucination | Mar 8, 2025 | Autonomous Drivingcounterfactual | CodeCode Available | 0 |
| Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption Utilization | Sep 22, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild | Mar 7, 2024 | HallucinationQuestion Answering | CodeCode Available | 0 |
| TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation | Feb 19, 2025 | Dataset GenerationGSM8K | CodeCode Available | 0 |
| ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope Questions | Oct 18, 2024 | HallucinationNatural Questions | CodeCode Available | 0 |
| Visually Dehallucinative Instruction Generation | Feb 13, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |