| Good Parenting is all you need -- Multi-agentic LLM Hallucination Mitigation | Oct 18, 2024 | AllHallucination | —Unverified | 0 |
| MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison Feedback | Oct 17, 2024 | Fact VerificationHallucination | CodeCode Available | 0 |
| ETF: An Entity Tracing Framework for Hallucination Detection in Code Summaries | Oct 17, 2024 | Code SummarizationHallucination | —Unverified | 0 |
| From Single to Multi: How LLMs Hallucinate in Multi-Document Summarization | Oct 17, 2024 | Document SummarizationHallucination | CodeCode Available | 0 |
| Utilizing Large Language Models in an iterative paradigm with domain feedback for zero-shot molecule optimization | Oct 17, 2024 | Drug DiscoveryHallucination | —Unverified | 0 |
| On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation | Oct 16, 2024 | HallucinationNatural Language Inference | —Unverified | 0 |
| What Do LLMs Need to Understand Graphs: A Survey of Parametric Representation of Graphs | Oct 16, 2024 | Drug DiscoveryGraph Generation | —Unverified | 0 |
| Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning | Oct 16, 2024 | Contrastive Learninggraph construction | —Unverified | 0 |
| A Claim Decomposition Benchmark for Long-form Answer Verification | Oct 16, 2024 | FormHallucination | CodeCode Available | 0 |
| RosePO: Aligning LLM-based Recommenders with Human Values | Oct 16, 2024 | HallucinationRecommendation Systems | —Unverified | 0 |
| When Not to Answer: Evaluating Prompts on GPT Models for Effective Abstention in Unanswerable Math Word Problems | Oct 16, 2024 | HallucinationMath | —Unverified | 0 |
| Controlled Automatic Task-Specific Synthetic Data Generation for Hallucination Detection | Oct 16, 2024 | HallucinationIn-Context Learning | —Unverified | 0 |
| AGENTiGraph: An Interactive Knowledge Graph Platform for LLM-based Chatbots Utilizing Private Data | Oct 15, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| On the Capacity of Citation Generation by Large Language Models | Oct 15, 2024 | AttributeHallucination | —Unverified | 0 |
| ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability | Oct 15, 2024 | HallucinationRAG | —Unverified | 0 |
| LargePiG: Your Large Language Model is Secretly a Pointer Generator | Oct 15, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models | Oct 15, 2024 | HallucinationLarge Language Model | CodeCode Available | 0 |
| Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions | Oct 15, 2024 | Hallucination | —Unverified | 0 |
| Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs | Oct 15, 2024 | Hallucination | —Unverified | 0 |
| Can Structured Data Reduce Epistemic Uncertainty? | Oct 14, 2024 | HallucinationRetrieval | —Unverified | 0 |
| Parenting: Optimizing Knowledge Selection of Retrieval-Augmented Language Models with Parameter Decoupling and Tailored Tuning | Oct 14, 2024 | HallucinationRAG | —Unverified | 0 |
| SkillAggregation: Reference-free LLM-Dependent Aggregation | Oct 14, 2024 | ChatbotHallucination | —Unverified | 0 |
| Medico: Towards Hallucination Detection and Correction with Multi-source Evidence Fusion | Oct 14, 2024 | Hallucination | —Unverified | 0 |
| Honest AI: Fine-Tuning "Small" Language Models to Say "I Don't Know", and Reducing Hallucination in RAG | Oct 13, 2024 | HallucinationRAG | —Unverified | 0 |
| Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code | Oct 13, 2024 | Code GenerationHallucination | —Unverified | 0 |