SOTAVerified

Adaptive Information Seeking for Open-Domain Question Answering

2021-09-14EMNLP 2021Code Available1· sign in to hype

Yunchang Zhu, Liang Pang, Yanyan Lan, HuaWei Shen, Xueqi Cheng

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Information seeking is an essential step for open-domain question answering to efficiently gather evidence from a large corpus. Recently, iterative approaches have been proven to be effective for complex questions, by recursively retrieving new evidence at each step. However, almost all existing iterative approaches use predefined strategies, either applying the same retrieval function multiple times or fixing the order of different retrieval functions, which cannot fulfill the diverse requirements of various questions. In this paper, we propose a novel adaptive information-seeking strategy for open-domain question answering, namely AISO. Specifically, the whole retrieval and answer process is modeled as a partially observed Markov decision process, where three types of retrieval operations (e.g., BM25, DPR, and hyperlink) and one answer operation are defined as actions. According to the learned policy, AISO could adaptively select a proper retrieval action to seek the missing evidence at each step, based on the collected evidence and the reformulated query, or directly output the answer when the evidence set is sufficient for the question. Experiments on SQuAD Open and HotpotQA fullwiki, which serve as single-hop and multi-hop open-domain QA benchmarks, show that AISO outperforms all baseline methods with predefined strategies in terms of both retrieval and answer evaluations.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
HotpotQAAISOJOINT-F10.72Unverified

Reproductions