Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6251–6275 of 10817 papers

Title	Date	Tasks	Status	Hype
HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation	Oct 16, 2021	Entity AlignmentQuestion Answering	—Unverified	0
Pro-KD: Progressive Distillation by Following the Footsteps of the Teacher	Oct 16, 2021	image-classificationImage Classification	—Unverified	0
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation	Oct 16, 2021	Common Sense ReasoningEmbodied Question Answering	—Unverified	0
Open Domain Question Answering with A Unified Knowledge Interface	Oct 16, 2021	Data-to-Text GenerationNatural Questions	CodeCode Available	1
Towards Transparent Interactive Semantic Parsing via Step-by-Step Correction	Oct 15, 2021	Question AnsweringSemantic Parsing	CodeCode Available	0
BBQ: A Hand-Built Bias Benchmark for Question Answering	Oct 15, 2021	Question Answering	CodeCode Available	1
Tracing Origins: Coreference-aware Machine Reading Comprehension	Oct 15, 2021	Language ModelingLanguage Modelling	CodeCode Available	1
Attacking Open-domain Question Answering by Injecting Misinformation	Oct 15, 2021	MisinformationOpen-Domain Question Answering	CodeCode Available	0
A Survey on State-of-the-art Techniques for Knowledge Graphs Construction and Challenges ahead	Oct 15, 2021	Knowledge GraphsLogical Reasoning	—Unverified	0
MixQG: Neural Question Generation with Mixed Answer Types	Oct 15, 2021	Multiple-choiceQuestion Answering	CodeCode Available	1
CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training	Oct 14, 2021	Open-Domain Question AnsweringQuestion Answering	CodeCode Available	1
Can Explanations Be Useful for Calibrating Black Box Models?	Oct 14, 2021	Extractive Question-AnsweringFew-Shot Learning	CodeCode Available	1
Retrieval-guided Counterfactual Generation for QA	Oct 14, 2021	counterfactualData Augmentation	—Unverified	0
Cross-Lingual Open-Domain Question Answering with Answer Sentence Generation	Oct 14, 2021	Answer GenerationGenerative Question Answering	—Unverified	0
Open-Domain Question-Answering for COVID-19 and Other Emergent Domains	Oct 13, 2021	DiversityMisinformation	CodeCode Available	0
MMIU: Dataset for Visual Intent Understanding in Multimodal Assistants	Oct 13, 2021	intent-classificationIntent Classification	—Unverified	0
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?	Oct 13, 2021	Open-Domain Question AnsweringPassage Retrieval	CodeCode Available	1
Improving Users' Mental Model with Attention-directed Counterfactual Edits	Oct 13, 2021	counterfactualQuestion Answering	—Unverified	0
Systematic Inequalities in Language Technology Performance across the World's Languages	Oct 13, 2021	Dependency ParsingMachine Translation	CodeCode Available	0
ConditionalQA: A Complex Reading Comprehension Dataset with Conditional Answers	Oct 13, 2021	Logical ReasoningQuestion Answering	CodeCode Available	1
A Survey on Legal Question Answering Systems	Oct 12, 2021	legal outcome extractionQuestion Answering	—Unverified	0
Mention Memory: incorporating textual knowledge into Transformers through entity mention attention	Oct 12, 2021	Claim VerificationNatural Language Understanding	CodeCode Available	0
Attention-guided Generative Models for Extractive Question Answering	Oct 12, 2021	DecoderExtractive Question-Answering	—Unverified	0
Explainable Fact-checking through Question Answering	Oct 11, 2021	Decision MakingFact Checking	—Unverified	0
Pano-AVQA: Grounded Audio-Visual Question Answering on 360^ Videos	Oct 11, 2021	Audio-visual Question AnsweringQuestion Answering	CodeCode Available	1

Show:10 25 50

← PrevPage 251 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified