| Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-Generation | Apr 4, 2025 | ClusteringHallucination | CodeCode Available | 1 | 5 |
| Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method | May 22, 2023 | BenchmarkingHallucination | CodeCode Available | 1 | 5 |
| Automatic Curriculum Expert Iteration for Reliable LLM Reasoning | Oct 10, 2024 | HallucinationLogical Reasoning | CodeCode Available | 1 | 5 |
| MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context | Jul 3, 2024 | HallucinationResponse Generation | CodeCode Available | 1 | 5 |
| MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory | Apr 17, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset | Oct 11, 2021 | BenchmarkingFace Hallucination | CodeCode Available | 1 | 5 |
| Dataset Distillation via Factorization | Oct 30, 2022 | Dataset DistillationHallucination | CodeCode Available | 1 | 5 |
| MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language Models | Jun 9, 2025 | DiagnosticHallucination | CodeCode Available | 1 | 5 |
| ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark | Jan 9, 2025 | FairnessHallucination | CodeCode Available | 1 | 5 |
| High-resolution Face Swapping via Latent Semantics Disentanglement | Mar 30, 2022 | DisentanglementFace Swapping | CodeCode Available | 1 | 5 |
| Lyra: Orchestrating Dual Correction in Automated Theorem Proving | Sep 27, 2023 | Automated Theorem ProvingHallucination | CodeCode Available | 1 | 5 |
| Holistic Analysis of Hallucination in GPT-4V(ision): Bias and Interference Challenges | Nov 6, 2023 | Hallucination | CodeCode Available | 1 | 5 |
| How Language Model Hallucinations Can Snowball | May 22, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| Med-HALT: Medical Domain Hallucination Test for Large Language Models | Jul 28, 2023 | HallucinationInformation Retrieval | CodeCode Available | 1 | 5 |
| Chain of Natural Language Inference for Reducing Large Language Model Ungrounded Hallucinations | Oct 6, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception | May 24, 2024 | Hallucination | CodeCode Available | 1 | 5 |
| BachGAN: High-Resolution Image Synthesis from Salient Object Layout | Mar 26, 2020 | Generative Adversarial NetworkHallucination | CodeCode Available | 1 | 5 |
| Balanced Classification: A Unified Framework for Long-Tailed Object Detection | Aug 4, 2023 | HallucinationLong-tailed Object Detection | CodeCode Available | 1 | 5 |
| DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation | Jun 9, 2024 | Common Sense ReasoningDenoising | CodeCode Available | 1 | 5 |
| Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous Sources | May 22, 2023 | HallucinationLanguage Modelling | CodeCode Available | 1 | 5 |
| A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs | May 13, 2025 | HallucinationUncertainty Quantification | CodeCode Available | 1 | 5 |
| A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity | Feb 8, 2023 | Code GenerationHallucination | CodeCode Available | 1 | 5 |
| Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature Augmentation | Sep 29, 2023 | 3D Object DetectionAttribute | CodeCode Available | 1 | 5 |
| Enhancing LLM's Cognition via Structurization | Jul 23, 2024 | HallucinationHallucination Evaluation | CodeCode Available | 1 | 5 |
| Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow | Feb 28, 2025 | HallucinationObject | CodeCode Available | 1 | 5 |