| EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset | Oct 11, 2021 | BenchmarkingFace Hallucination | CodeCode Available | 1 | 5 |
| Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation | Mar 25, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 1 | 5 |
| Learning to Automate Follow-up Question Generation using Process Knowledge for Depression Triage on Reddit Posts | May 27, 2022 | HallucinationQuestion Generation | CodeCode Available | 1 | 5 |
| LiDAR-based 4D Occupancy Completion and Forecasting | Oct 17, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 | 5 |
| Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning | Aug 30, 2024 | Hallucination | CodeCode Available | 1 | 5 |
| Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene Descriptions | Jun 9, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation | Jun 9, 2024 | Common Sense ReasoningDenoising | CodeCode Available | 1 | 5 |
| K-QA: A Real-World Medical Q&A Benchmark | Jan 25, 2024 | HallucinationIn-Context Learning | CodeCode Available | 1 | 5 |
| Label Hallucination for Few-Shot Classification | Dec 6, 2021 | ClassificationFew-Shot Learning | CodeCode Available | 1 | 5 |
| LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video Reconstruction | Aug 22, 2023 | HallucinationMotion Compensation | CodeCode Available | 1 | 5 |
| Know Or Not: a library for evaluating out-of-knowledge base robustness | May 19, 2025 | HallucinationRAG | CodeCode Available | 1 | 5 |
| KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality | Jun 24, 2025 | HallucinationHallucination Evaluation | CodeCode Available | 1 | 5 |
| Doc2Query--: When Less is More | Jan 9, 2023 | HallucinationRetrieval | CodeCode Available | 1 | 5 |
| Distinguishing Ignorance from Error in LLM Hallucinations | Oct 29, 2024 | HallucinationQuestion Answering | CodeCode Available | 1 | 5 |
| KoLA: Carefully Benchmarking World Knowledge of Large Language Models | Jun 15, 2023 | BenchmarkingHallucination | CodeCode Available | 1 | 5 |
| Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based Reasoning | Jan 31, 2023 | HallucinationSemantic Parsing | CodeCode Available | 1 | 5 |
| A Survey of Hallucination in Large Foundation Models | Sep 12, 2023 | HallucinationSurvey | CodeCode Available | 1 | 5 |
| Citation-Enhanced Generation for LLM-based Chatbots | Feb 25, 2024 | ChatbotCitation Prediction | CodeCode Available | 1 | 5 |
| DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion Models | Jun 13, 2025 | AllHallucination | CodeCode Available | 1 | 5 |
| Circuit Transformer: A Transformer That Preserves Logical Equivalence | Mar 14, 2024 | Hallucination | CodeCode Available | 1 | 5 |
| DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language Models | Mar 1, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 | 5 |
| Knowledge Graph-based Retrieval-Augmented Generation for Schema Matching | Jan 15, 2025 | HallucinationKnowledge Graphs | CodeCode Available | 1 | 5 |
| Knowledge Graph-Enhanced Large Language Models via Path Selection | Jun 19, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 1 | 5 |
| AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant | Nov 11, 2024 | Decision MakingHallucination | CodeCode Available | 1 | 5 |
| CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based Tools | Jul 28, 2023 | Hallucination | CodeCode Available | 1 | 5 |