SOTAVerified

Chain-of-Action: Faithful and Multimodal Question Answering through Large Language Models

2024-03-26Code Available0· sign in to hype

Zhenyu Pan, Haozheng Luo, Manling Li, Han Liu

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We present a Chain-of-Action (CoA) framework for multimodal and retrieval-augmented Question-Answering (QA). Compared to the literature, CoA overcomes two major challenges of current QA applications: (i) unfaithful hallucination that is inconsistent with real-time or domain facts and (ii) weak reasoning performance over compositional information. Our key contribution is a novel reasoning-retrieval mechanism that decomposes a complex question into a reasoning chain via systematic prompting and pre-designed actions. Methodologically, we propose three types of domain-adaptable `Plug-and-Play' actions for retrieving real-time information from heterogeneous sources. We also propose a multi-reference faith score (MRFS) to verify and resolve conflicts in the answers. Empirically, we exploit both public benchmarks and a Web3 case study to demonstrate the capability of CoA over other methods.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
FEVERCoA w/o actionsEM54.2Unverified
FEVERDSPEM62.2Unverified
FEVERSelf-AskEM64.2Unverified
FEVERCoAEM68.9Unverified
FEVERZero-shotEM50Unverified
StrategyQACoAEM79.2Unverified
StrategyQASearchChainEM77Unverified
StrategyQACoA w/o actionsEM70.6Unverified
StrategyQALeast-to-MostEM65.8Unverified
TruthfulQACoA w/o actionsEM63.3Unverified
TruthfulQACoAEM67.3Unverified
WebQuestionsSelf-AskEM31.1Unverified
WebQuestionsToTEM26.3Unverified
WebQuestionsZero-shotEM43Unverified
WebQuestionsFew-shotEM44.7Unverified
WebQuestionsDSPEM59.4Unverified
WebQuestionsCoA w/o actionsEM64.7Unverified
WebQuestionsCoTEM42.5Unverified
WebQuestionsCoAEM70.7Unverified
WebQuestionsReactEM38.3Unverified

Reproductions