| KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions | Jul 8, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization | Oct 17, 2024 | Document SummarizationHallucination | CodeCode Available | 0 | 5 |
| Joint stereo 3D object detection and implicit surface reconstruction | Nov 25, 2021 | 3D Object DetectionHallucination | CodeCode Available | 0 | 5 |
| AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts | Apr 17, 2024 | Deep LearningHallucination | CodeCode Available | 0 | 5 |
| JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images | Sep 19, 2024 | HallucinationImage Captioning | CodeCode Available | 0 | 5 |
| keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection | May 23, 2025 | HallucinationLanguage Modeling | CodeCode Available | 0 | 5 |
| Iterative Teaching by Data Hallucination | Oct 31, 2022 | Hallucination | CodeCode Available | 0 | 5 |
| Assessing the Reliability of Large Language Model Knowledge | Oct 15, 2023 | HallucinationKnowledge Probing | CodeCode Available | 0 | 5 |
| Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation Models | Nov 13, 2023 | HallucinationMachine Translation | CodeCode Available | 0 | 5 |
| Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models | Oct 4, 2024 | counterfactualData Augmentation | CodeCode Available | 0 | 5 |
| Confidence Estimation for LLM-Based Dialogue State Tracking | Sep 15, 2024 | Dialogue State TrackingHallucination | CodeCode Available | 0 | 5 |
| Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object Detection | Nov 19, 2023 | counterfactualHallucination | CodeCode Available | 0 | 5 |
| Investigating the performance of Retrieval-Augmented Generation and fine-tuning for the development of AI-driven knowledge-based systems | Mar 12, 2024 | Domain AdaptationHallucination | CodeCode Available | 0 | 5 |
| Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports | Apr 9, 2024 | FormHallucination | CodeCode Available | 0 | 5 |
| Instruction Makes a Difference | Feb 1, 2024 | HallucinationInstruction Following | CodeCode Available | 0 | 5 |
| Characterizing Context Influence and Hallucination in Summarization | Oct 3, 2024 | Hallucination | CodeCode Available | 0 | 5 |
| Improving Factual Error Correction by Learning to Inject Factual Errors | Dec 12, 2023 | Hallucination | CodeCode Available | 0 | 5 |
| Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators | Aug 22, 2024 | HallucinationMixture-of-Experts | CodeCode Available | 0 | 5 |
| Incorporating Task-specific Concept Knowledge into Script Learning | Aug 31, 2022 | Contrastive LearningHallucination | CodeCode Available | 0 | 5 |
| Integrating Chemistry Knowledge in Large Language Models via Prompt Engineering | Apr 22, 2024 | HallucinationPrompt Engineering | CodeCode Available | 0 | 5 |
| Im2Avatar: Colorful 3D Reconstruction from a Single Image | Apr 17, 2018 | 3D ReconstructionHallucination | CodeCode Available | 0 | 5 |
| HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs | Feb 25, 2024 | BenchmarkingChatbot | CodeCode Available | 0 | 5 |
| Im2Flow: Motion Hallucination from Static Images for Action Recognition | Dec 12, 2017 | Action RecognitionActivity Recognition | CodeCode Available | 0 | 5 |
| How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild | Feb 18, 2025 | ArticlesHallucination | CodeCode Available | 0 | 5 |
| A Claim Decomposition Benchmark for Long-form Answer Verification | Oct 16, 2024 | FormHallucination | CodeCode Available | 0 | 5 |