| KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection | Oct 13, 2023 | Abstractive Text SummarizationHallucination | CodeCode Available | 1 | 5 |
| Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy | Feb 25, 2024 | HallucinationSentence | CodeCode Available | 1 | 5 |
| AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning | Nov 25, 2024 | HallucinationQuestion Answering | CodeCode Available | 1 | 5 |
| A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text Generation | Apr 18, 2021 | FormHallucination | CodeCode Available | 1 | 5 |
| Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed Inputs | Aug 2, 2024 | AttributeHallucination | CodeCode Available | 1 | 5 |
| Detecting Hallucinated Content in Conditional Neural Sequence Generation | Nov 5, 2020 | Abstractive Text SummarizationHallucination | CodeCode Available | 1 | 5 |
| Distinguishing Ignorance from Error in LLM Hallucinations | Oct 29, 2024 | HallucinationQuestion Answering | CodeCode Available | 1 | 5 |
| "Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference Letters | Oct 13, 2023 | BenchmarkingFairness | CodeCode Available | 1 | 5 |
| Know Or Not: a library for evaluating out-of-knowledge base robustness | May 19, 2025 | HallucinationRAG | CodeCode Available | 1 | 5 |
| Learning to Automate Follow-up Question Generation using Process Knowledge for Depression Triage on Reddit Posts | May 27, 2022 | HallucinationQuestion Generation | CodeCode Available | 1 | 5 |