| MAVEN-Fact: A Large-scale Event Factuality Detection Dataset | Jul 22, 2024 | Hallucination | CodeCode Available | 0 | 5 |
| MCiteBench: A Multimodal Benchmark for Generating Text with Citations | Mar 4, 2025 | HallucinationText Generation | CodeCode Available | 0 | 5 |
| Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations | Mar 27, 2024 | AttributeDiagnostic | CodeCode Available | 0 | 5 |
| From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic Data | Jun 27, 2024 | HallucinationInformation Retrieval | CodeCode Available | 0 | 5 |
| Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models | Jul 23, 2024 | HallucinationMachine Translation | CodeCode Available | 0 | 5 |
| Low to High Dimensional Modality Hallucination using Aggregated Fields of View | Jul 13, 2020 | HallucinationVocal Bursts Intensity Prediction | CodeCode Available | 0 | 5 |
| LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model Compression | Mar 6, 2025 | BenchmarkingCommon Sense Reasoning | CodeCode Available | 0 | 5 |
| MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language Models | Oct 19, 2023 | HallucinationMathematical Reasoning | CodeCode Available | 0 | 5 |
| MedScore: Factuality Evaluation of Free-Form Medical Answers | May 24, 2025 | FormHallucination | CodeCode Available | 0 | 5 |
| Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits Calibration | Jun 26, 2025 | HallucinationText Generation | CodeCode Available | 0 | 5 |
| On hallucinations in tomographic image reconstruction | Dec 1, 2020 | HallucinationImage Reconstruction | CodeCode Available | 0 | 5 |
| LLM Internal States Reveal Hallucination Risk Faced With a Query | Jul 3, 2024 | HallucinationResponse Generation | CodeCode Available | 0 | 5 |
| LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation | Sep 30, 2024 | Code GenerationHallucination | CodeCode Available | 0 | 5 |
| LLM Inference Enhanced by External Knowledge: A Survey | May 30, 2025 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries | May 19, 2025 | HallucinationRetrieval | CodeCode Available | 0 | 5 |
| LLMs and Memorization: On Quality and Specificity of Copyright Compliance | May 28, 2024 | HallucinationMemorization | CodeCode Available | 0 | 5 |
| A Comparative Study on Language Models for Task-Oriented Dialogue Systems | Jan 21, 2022 | Dialogue State TrackingHallucination | CodeCode Available | 0 | 5 |
| LightHouse: A Survey of AGI Hallucination | Jan 8, 2024 | HallucinationSurvey | CodeCode Available | 0 | 5 |
| Linear Correlation in LM's Compositional Generalization and Hallucination | Feb 6, 2025 | Hallucination | CodeCode Available | 0 | 5 |
| Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient Conversations | Sep 24, 2021 | Hallucination | CodeCode Available | 0 | 5 |
| Fine-tuning Large Language Models for Improving Factuality in Legal Question Answering | Jan 11, 2025 | HallucinationQuestion Answering | CodeCode Available | 0 | 5 |
| Learning with privileged information via adversarial discriminative modality distillation | Oct 19, 2018 | Action RecognitionHallucination | CodeCode Available | 0 | 5 |
| Localizing and Mitigating Errors in Long-form Question Answering | Jul 16, 2024 | FormHallucination | CodeCode Available | 0 | 5 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 | 5 |
| Learning Fine-grained Domain Generalization via Hyperbolic State Space Hallucination | Apr 10, 2025 | Domain GeneralizationHallucination | CodeCode Available | 0 | 5 |
| Learning on LLM Output Signatures for gray-box LLM Behavior Analysis | Mar 18, 2025 | Hallucination | CodeCode Available | 0 | 5 |
| Fine-grained Contract NER using instruction based model | Jan 24, 2024 | Few-Shot LearningHallucination | CodeCode Available | 0 | 5 |
| AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation | Mar 14, 2025 | Abstractive Text SummarizationChunking | CodeCode Available | 0 | 5 |
| Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text Generation | Oct 23, 2023 | Abstractive Text SummarizationDialogue Generation | CodeCode Available | 0 | 5 |
| Multi-Source Knowledge Pruning for Retrieval-Augmented Generation: A Benchmark and Empirical Study | Sep 3, 2024 | BenchmarkingHallucination | CodeCode Available | 0 | 5 |
| Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts | Aug 21, 2023 | ArticlesHallucination | CodeCode Available | 0 | 5 |
| Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language Models | Feb 8, 2025 | Conformal PredictionDecision Making | CodeCode Available | 0 | 5 |
| Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks | Jul 1, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| Few-shot learning via tensor hallucination | Apr 19, 2021 | Data AugmentationFew-Shot Learning | CodeCode Available | 0 | 5 |
| AILS-NTUA at SemEval-2024 Task 6: Efficient model tuning for hallucination detection and analysis | Apr 1, 2024 | Binary ClassificationHallucination | CodeCode Available | 0 | 5 |
| CiteBART: Learning to Generate Citations for Local Citation Recommendation | Dec 23, 2024 | Citation PredictionCitation Recommendation | CodeCode Available | 0 | 5 |
| Fakes of Varying Shades: How Warning Affects Human Perception and Engagement Regarding LLM Hallucinations | Apr 4, 2024 | HallucinationHuman Detection | CodeCode Available | 0 | 5 |
| KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions | Jul 8, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection | May 23, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language Models | Mar 13, 2024 | Hallucination | CodeCode Available | 0 | 5 |
| Iterative Teaching by Data Hallucination | Oct 31, 2022 | Hallucination | CodeCode Available | 0 | 5 |
| Investigating the performance of Retrieval-Augmented Generation and fine-tuning for the development of AI-driven knowledge-based systems | Mar 12, 2024 | Domain AdaptationHallucination | CodeCode Available | 0 | 5 |
| AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts | Apr 17, 2024 | Deep LearningHallucination | CodeCode Available | 0 | 5 |
| Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models | Nov 13, 2023 | HallucinationMachine Translation | CodeCode Available | 0 | 5 |
| Joint stereo 3D object detection and implicit surface reconstruction | Nov 25, 2021 | 3D Object DetectionHallucination | CodeCode Available | 0 | 5 |
| Integrating Chemistry Knowledge in Large Language Models via Prompt Engineering | Apr 22, 2024 | HallucinationPrompt Engineering | CodeCode Available | 0 | 5 |
| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 | 5 |
| Instruction Makes a Difference | Feb 1, 2024 | HallucinationInstruction Following | CodeCode Available | 0 | 5 |
| Incorporating Task-specific Concept Knowledge into Script Learning | Aug 31, 2022 | Contrastive LearningHallucination | CodeCode Available | 0 | 5 |
| Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models | Oct 4, 2024 | counterfactualData Augmentation | CodeCode Available | 0 | 5 |