Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1250 of 1816 papers

Title	Date	Tasks	Status
Hallucination Mitigation Prompts Long-term Video Understanding	Jun 17, 2024	Answer GenerationHallucination	CodeCode Available
Self-training Large Language Models through Knowledge Detection	Jun 17, 2024	HallucinationLanguage Modeling	CodeCode Available
Teaching Large Language Models to Express Knowledge Boundary from Their Own Signals	Jun 16, 2024	Hallucination	—Unverified
Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded Conversations	Jun 16, 2024	HallucinationMisinformation	CodeCode Available
Detecting and Evaluating Medical Hallucinations in Large Vision Language Models	Jun 14, 2024	HallucinationMedical Visual Question Answering	—Unverified
DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation	Jun 13, 2024	BenchmarkingHallucination	CodeCode Available
HalluDial: A Large-Scale Benchmark for Automatic Dialogue-Level Hallucination Evaluation	Jun 11, 2024	HallucinationHallucination Evaluation	CodeCode Available
Beyond Words: On Large Language Models Actionability in Mission-Critical Risk Analysis	Jun 11, 2024	HallucinationLanguage Modelling	—Unverified
Progressive Query Expansion for Retrieval Over Cost-constrained Data Sources	Jun 11, 2024	HallucinationRetrieval	—Unverified
On the Hallucination in Simultaneous Machine Translation	Jun 11, 2024	HallucinationMachine Translation	CodeCode Available
Estimating the Hallucination Rate of Generative AI	Jun 11, 2024	HallucinationIn-Context Learning	—Unverified
A Probabilistic Framework for LLM Hallucination Detection via Belief Tree Propagation	Jun 11, 2024	Hallucination	CodeCode Available
Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation	Jun 8, 2024	Abstractive Text SummarizationDialogue Generation	—Unverified
Robustness Assessment of Mathematical Reasoning in the Presence of Missing and Contradictory Conditions	Jun 7, 2024	HallucinationMathematical Reasoning	—Unverified
Chaos with Keywords: Exposing Large Language Models Sycophantic Hallucination to Misleading Keywords and Evaluating Defense Strategies	Jun 6, 2024	HallucinationKnowledge Probing	—Unverified
Confabulation: The Surprising Value of Large Language Model Hallucinations	Jun 6, 2024	HallucinationLanguage Modeling	—Unverified
ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints	Jun 6, 2024	DiagnosticHallucination	—Unverified
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework	Jun 5, 2024	Fact CheckingHallucination	—Unverified
Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends	Jun 5, 2024	Hallucination	—Unverified
Enhancing Trust in LLMs: Algorithms for Comparing and Interpreting LLMs	Jun 4, 2024	BenchmarkingFairness	—Unverified
How to Explore with Belief: State Entropy Maximization in POMDPs	Jun 4, 2024	Hallucination	—Unverified
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models	Jun 4, 2024	HallucinationInformativeness	—Unverified
OTTAWA: Optimal TransporT Adaptive Word Aligner for Hallucination and Omission Translation Errors Detection	Jun 4, 2024	HallucinationMachine Translation	CodeCode Available
Ask-EDA: A Design Assistant Empowered by LLM, Hybrid RAG and Abbreviation De-hallucination	Jun 3, 2024	HallucinationQuestion Answering	—Unverified
Large Language Model Assisted Optimal Bidding of BESS in FCAS Market: An AI-agent based Approach	Jun 3, 2024	AI AgentDeep Reinforcement Learning	—Unverified
Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs	Jun 3, 2024	Decision MakingEvent Argument Extraction	—Unverified
Luna: An Evaluation Foundation Model to Catch Language Model Hallucinations with High Accuracy and Low Cost	Jun 3, 2024	HallucinationLanguage Modeling	—Unverified
Comprehensive Evaluation of Large Language Models for Topic Modeling	Jun 2, 2024	HallucinationTopic Models	—Unverified
DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models	May 31, 2024	HallucinationModel Editing	CodeCode Available
NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language Models	May 30, 2024	Hallucination	CodeCode Available
Similarity is Not All You Need: Endowing Retrieval Augmented Generation with Multi Layered Thoughts	May 30, 2024	AllHallucination	—Unverified
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools	May 30, 2024	HallucinationRAG	—Unverified
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification	May 29, 2024	HallucinationImage Captioning	—Unverified
MASSIVE Multilingual Abstract Meaning Representation: A Dataset and Baselines for Hallucination Detection	May 29, 2024	Abstract Meaning RepresentationHallucination	—Unverified
Two-Layer Retrieval-Augmented Generation Framework for Low-Resource Medical Question Answering Using Reddit Data: Proof-of-Concept Study	May 29, 2024	Answer GenerationHallucination	—Unverified
LLMs and Memorization: On Quality and Specificity of Copyright Compliance	May 28, 2024	HallucinationMemorization	CodeCode Available
Conv-CoA: Improving Open-domain Question Answering in Large Language Models via Conversational Chain-of-Action	May 28, 2024	Conversational Question AnsweringHallucination	—Unverified
Data-augmented phrase-level alignment for mitigating object hallucination	May 28, 2024	Data AugmentationHallucination	—Unverified
RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language Models	May 28, 2024	HallucinationMME	—Unverified
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings	May 27, 2024	Domain AdaptationGPU	—Unverified
Think Before You Act: A Two-Stage Framework for Mitigating Gender Bias Towards Vision-Language Tasks	May 27, 2024	HallucinationObject Hallucination	CodeCode Available
GeneAgent: Self-verification Language Agent for Gene Set Knowledge Discovery using Domain Databases	May 25, 2024	BenchmarkingHallucination	—Unverified
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization	May 24, 2024	Hallucination	CodeCode Available
CHARP: Conversation History AwaReness Probing for Knowledge-grounded Dialogue Systems	May 24, 2024	DiagnosticHallucination	—Unverified
Large Language Model Pruning	May 24, 2024	HallucinationLanguage Modeling	—Unverified
Scaling Laws for Discriminative Classification in Large Language Models	May 24, 2024	HallucinationLanguage Modeling	—Unverified
WISE: Rethinking the Knowledge Memory for Lifelong Model Editing of Large Language Models	May 23, 2024	HallucinationModel Editing	—Unverified
GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games	May 22, 2024	Code GenerationDecision Making	—Unverified
Less for More: Enhanced Feedback-aligned Mixed LLMs for Molecule Caption Generation and Fine-Grained NLI Evaluation	May 22, 2024	Caption GenerationHallucination	—Unverified
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models	May 22, 2024	BenchmarkingHallucination	—Unverified

Show:10 25 50

← PrevPage 25 of 37Next →

No leaderboard results yet.