SOTAVerified

Hallucination

Papers

Showing 951975 of 1816 papers

TitleStatusHype
Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions0
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less HallucinationCode2
Chaos with Keywords: Exposing Large Language Models Sycophantic Hallucination to Misleading Keywords and Evaluating Defense Strategies0
ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints0
Confabulation: The Surprising Value of Large Language Model Hallucinations0
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework0
Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends0
OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors DetectionCode0
Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs0
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models0
How to Explore with Belief: State Entropy Maximization in POMDPs0
Ask-EDA: A Design Assistant Empowered by LLM, Hybrid RAG and Abbreviation De-hallucination0
Large Language Model Assisted Optimal Bidding of BESS in FCAS Market: An AI-agent based Approach0
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs0
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost0
Comprehensive Evaluation of Large Language Models for Topic Modeling0
DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language ModelsCode0
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial TrainingCode1
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts0
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools0
ANAH: Analytical Annotation of Hallucinations in Large Language ModelsCode2
NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language ModelsCode0
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification0
MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection0
Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study0
Show:102550
← PrevPage 39 of 73Next →

No leaderboard results yet.