SOTAVerified

Hallucination

Papers

Showing 12011225 of 1816 papers

TitleStatusHype
Hallucination Mitigation Prompts Long-term Video UnderstandingCode0
Self-training Large Language Models through Knowledge DetectionCode0
Teaching Large Language Models to Express Knowledge Boundary from Their Own Signals0
Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded ConversationsCode0
Detecting and Evaluating Medical Hallucinations in Large Vision Language Models0
DefAn: Definitive Answer Dataset for LLMs Hallucination EvaluationCode0
HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination EvaluationCode0
Beyond Words: On Large Language Models Actionability in Mission-Critical Risk Analysis0
Progressive Query Expansion for Retrieval Over Cost-constrained Data Sources0
On the Hallucination in Simultaneous Machine TranslationCode0
Estimating the Hallucination Rate of Generative AI0
A Probabilistic Framework for LLM Hallucination Detection via Belief Tree PropagationCode0
Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation0
Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions0
Chaos with Keywords: Exposing Large Language Models Sycophantic Hallucination to Misleading Keywords and Evaluating Defense Strategies0
Confabulation: The Surprising Value of Large Language Model Hallucinations0
ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints0
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework0
Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends0
Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs0
How to Explore with Belief: State Entropy Maximization in POMDPs0
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models0
OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors DetectionCode0
Ask-EDA: A Design Assistant Empowered by LLM, Hybrid RAG and Abbreviation De-hallucination0
Large Language Model Assisted Optimal Bidding of BESS in FCAS Market: An AI-agent based Approach0
Show:102550
← PrevPage 49 of 73Next →

No leaderboard results yet.