DecoPrompt : Decoding Prompts Reduces Hallucinations when Large Language Models Meet False Premises

2024-11-12Code Available0· sign in to hype

Nan Xu, Xuezhe Ma

Code Available — Be the first to reproduce this paper.

Code

github.com/xunannancy/decoprompt
OfficialIn papernone★ 4

Abstract

While large language models (LLMs) have demonstrated increasing power, they have also called upon studies on their hallucinated outputs that deviate from factually correct statements. In this paper, we focus on one important scenario of false premises, where LLMs are distracted by misaligned claims although the model possesses the required factual knowledge to answer original questions accurately. Inspired by the observation that entropy of the false-premise prompt is closely related to its likelihood to elicit hallucination generation, we propose a new prompting algorithm, named DecoPrompt, to mitigate hallucination. DecoPrompt leverages LLMs to "decode" the false-premise prompts without really eliciting hallucination output from LLMs. We perform experiments on two datasets, demonstrating that DecoPrompt can reduce hallucinations effectively on outputs from different LLMs. Moreover, DecoPrompt exhibits cross-model transferability, which facilitates its applications to scenarios such as LLMs of large sizes or unavailable model logits.

Tasks

Hallucination

DecoPrompt : Decoding Prompts Reduces Hallucinations when Large Language Models Meet False Premises

Code

Abstract

Tasks

Reproductions