| Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage | Dec 20, 2024 | AttributeBenchmarking | —Unverified | 0 | 0 |
| Towards Analyzing and Mitigating Sycophancy in Large Vision-Language Models | Aug 21, 2024 | HallucinationPrompt Engineering | —Unverified | 0 | 0 |
| Towards a Reliable Offline Personal AI Assistant for Long Duration Spaceflight | Oct 21, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 | 0 |
| CorpusLM: Towards a Unified Language Model on Corpus for Knowledge-Intensive Tasks | Feb 2, 2024 | Answer GenerationHallucination | —Unverified | 0 | 0 |
| Towards Clinical Encounter Summarization: Learning to Compose Discharge Summaries from Prior Notes | Apr 27, 2021 | HallucinationInformativeness | —Unverified | 0 | 0 |
| Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework | Jun 5, 2024 | Fact CheckingHallucination | —Unverified | 0 | 0 |
| Towards Mitigating Hallucination in Large Language Models via Self-Reflection | Oct 10, 2023 | Answer GenerationHallucination | —Unverified | 0 | 0 |
| Towards Multi-Source Retrieval-Augmented Generation via Synergizing Reasoning and Preference-Driven Retrieval | Nov 1, 2024 | HallucinationRAG | —Unverified | 0 | 0 |
| Towards Omnidirectional Reasoning with 360-R1: A Dataset, Benchmark, and GRPO-based Method | May 20, 2025 | HallucinationObject Localization | —Unverified | 0 | 0 |
| Towards reducing hallucination in extracting information from financial reports using Large Language Models | Oct 16, 2023 | HallucinationOptical Character Recognition | —Unverified | 0 | 0 |