| On Mitigating Code LLM Hallucinations with API Documentation | Jul 13, 2024 | Hallucinationvalid | —Unverified | 0 |
| DAHRS: Divergence-Aware Hallucination-Remediated SRL Projection | Jul 12, 2024 | fr-enHallucination | —Unverified | 0 |
| Mitigating Entity-Level Hallucination in Large Language Models | Jul 12, 2024 | HallucinationInformation Retrieval | CodeCode Available | 0 |
| The Two Sides of the Coin: Hallucination Generation and Detection with LLMs as Evaluators for LLMs | Jul 12, 2024 | Hallucination | —Unverified | 0 |
| Lynx: An Open Source Hallucination Evaluation Model | Jul 11, 2024 | HallucinationHallucination Evaluation | —Unverified | 0 |
| On the Universal Truthfulness Hyperplane Inside LLMs | Jul 11, 2024 | DiversityDomain Generalization | CodeCode Available | 0 |
| Knowledge Overshadowing Causes Amalgamated Hallucination in Large Language Models | Jul 10, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Learning with Instance-Dependent Noisy Labels by Anchor Hallucination and Hard Sample Label Correction | Jul 10, 2024 | Hallucination | —Unverified | 0 |
| Fuse, Reason and Verify: Geometry Problem Solving with Parsed Clauses from Diagram | Jul 10, 2024 | DecoderGeometry Problem Solving | —Unverified | 0 |
| Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps | Jul 9, 2024 | ArticlesHallucination | CodeCode Available | 2 |
| GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation | Jul 8, 2024 | BenchmarkingGraph Embedding | —Unverified | 0 |
| Controllable and Reliable Knowledge-Intensive Task-Oriented Conversational Agents with Declarative Genie Worksheets | Jul 8, 2024 | HallucinationNavigate | CodeCode Available | 2 |
| Multi-Object Hallucination in Vision-Language Models | Jul 8, 2024 | HallucinationObject Hallucination | CodeCode Available | 1 |
| Vision-Language Models under Cultural and Inclusive Considerations | Jul 8, 2024 | HallucinationSurvey | —Unverified | 0 |
| KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise Questions | Jul 8, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 |
| VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool | Jul 7, 2024 | Active LearningHallucination | —Unverified | 0 |
| Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses | Jul 7, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? | Jul 5, 2024 | HallucinationImage Generation | CodeCode Available | 1 |
| Code Hallucination | Jul 5, 2024 | Hallucination | —Unverified | 0 |
| ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models | Jul 5, 2024 | HallucinationLong Form Question Answering | CodeCode Available | 2 |
| Classification-Based Automatic HDL Code Generation Using LLMs | Jul 4, 2024 | ClassificationCode Generation | —Unverified | 0 |
| Zero-shot Persuasive Chatbots with LLM-Generated Strategies and Information Retrieval | Jul 4, 2024 | ChatbotHallucination | —Unverified | 0 |
| Hallucination Detection: Robustly Discerning Reliable Answers in Large Language Models | Jul 4, 2024 | HallucinationQuestion Answering | —Unverified | 0 |
| STOC-TOT: Stochastic Tree-of-Thought with Constrained Decoding for Complex Reasoning in Multi-Hop Question Answering | Jul 4, 2024 | HallucinationMulti-hop Question Answering | —Unverified | 0 |
| Query-Guided Self-Supervised Summarization of Nursing Notes | Jul 4, 2024 | Abstractive Text SummarizationDomain Adaptation | —Unverified | 0 |