| Estimating the Hallucination Rate of Generative AI | Jun 11, 2024 | HallucinationIn-Context Learning | —Unverified | 0 |
| A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation | Jun 11, 2024 | Hallucination | CodeCode Available | 0 |
| Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation | Jun 8, 2024 | Abstractive Text SummarizationDialogue Generation | —Unverified | 0 |
| Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions | Jun 7, 2024 | HallucinationMathematical Reasoning | —Unverified | 0 |
| Chaos with Keywords: Exposing Large Language Models Sycophantic Hallucination to Misleading Keywords and Evaluating Defense Strategies | Jun 6, 2024 | HallucinationKnowledge Probing | —Unverified | 0 |
| Confabulation: The Surprising Value of Large Language Model Hallucinations | Jun 6, 2024 | HallucinationLanguage Modeling | —Unverified | 0 |
| ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints | Jun 6, 2024 | DiagnosticHallucination | —Unverified | 0 |
| Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework | Jun 5, 2024 | Fact CheckingHallucination | —Unverified | 0 |
| Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends | Jun 5, 2024 | Hallucination | —Unverified | 0 |
| Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs | Jun 4, 2024 | BenchmarkingFairness | —Unverified | 0 |