| HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation | Jun 11, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 0 |
| A Comparative Study on Language Models for Task-Oriented Dialogue Systems | Jan 21, 2022 | Dialogue State TrackingHallucination | CodeCode Available | 0 |
| Characterizing Context Influence and Hallucination in Summarization | Oct 3, 2024 | Hallucination | CodeCode Available | 0 |
| DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping | Sep 11, 2023 | HallucinationInstruction Following | CodeCode Available | 0 |
| Zero-Resource Hallucination Prevention for Large Language Models | Sep 6, 2023 | Hallucination | CodeCode Available | 0 |
| Tensor feature hallucination for few-shot learning | Jun 9, 2021 | Data AugmentationFew-Shot Learning | CodeCode Available | 0 |
| CHAIR -- Classifier of Hallucination as Improver | Jan 5, 2025 | HallucinationMMLU | CodeCode Available | 0 |
| Chainpoll: A high efficacy method for LLM hallucination detection | Oct 22, 2023 | HallucinationRetrieval-augmented Generation | CodeCode Available | 0 |
| TUBench: Benchmarking Large Vision-Language Models on Trustworthiness with Unanswerable Questions | Oct 5, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language Models | Dec 29, 2024 | HallucinationObject | CodeCode Available | 0 |