Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8676–8700 of 10817 papers

Title	Date	Tasks	Status
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering	Nov 1, 2018	Factual Visual Question AnsweringGeneral Knowledge	—Unverified
An Interactive Web-Interface for Visualizing the Inner Workings of the Question Answering LSTM	Nov 1, 2018	Feature EngineeringMachine Translation	—Unverified
Explaining non-linear Classifier Decisions within Kernel-based Deep Architectures	Nov 1, 2018	General ClassificationImage Classification	—Unverified
Expletives in Universal Dependency Treebanks	Nov 1, 2018	Coreference ResolutionQuestion Answering	CodeCode Available
CogCompTime: A Tool for Understanding Time in Natural Language	Nov 1, 2018	Natural Language UnderstandingQuestion Answering	—Unverified
Exploiting Attention to Reveal Shortcomings in Memory Models	Nov 1, 2018	BIG-bench Machine LearningDecision Making	—Unverified
Preferred Answer Selection in Stack Overflow: Better Text Representations ... and Metadata, Metadata, Metadata	Nov 1, 2018	Answer SelectionCommunity Question Answering	—Unverified
OpenKE: An Open Toolkit for Knowledge Embedding	Nov 1, 2018	Information RetrievalKnowledge Graphs	CodeCode Available
Proceedings of the 6th BioASQ Workshop A challenge on large-scale biomedical semantic indexing and question answering	Nov 1, 2018	Question Answering	—Unverified
Automatic Opinion Question Generation	Nov 1, 2018	Community Question AnsweringQuestion Answering	CodeCode Available
Ontology-Based Retrieval \& Neural Approaches for BioASQ Ideal Answer Generation	Nov 1, 2018	Abstractive Text SummarizationAnswer Generation	—Unverified
Extraction Meets Abstraction: Ideal Answer Generation for Biomedical Questions	Nov 1, 2018	Abstractive Text SummarizationAnswer Generation	—Unverified
Retrieve and Re-rank: A Simple and Effective IR Approach to Simple Question Answering over Knowledge Graphs	Nov 1, 2018	Entity LinkingInformation Retrieval	—Unverified
On the Generation of Medical Question-Answer Pairs	Nov 1, 2018	DecoderDiversity	—Unverified
Interactive Instance-based Evaluation of Knowledge Base Question Answering	Nov 1, 2018	Entity LinkingKnowledge Base Question Answering	CodeCode Available
An Adaption of BIOASQ Question Answering dataset for Machine Reading systems by Manual Annotations of Answer Spans.	Nov 1, 2018	AllDomain Adaptation	—Unverified
Results of the sixth edition of the BioASQ Challenge	Nov 1, 2018	Information RetrievalQuestion Answering	—Unverified
Macquarie University at BioASQ 6b: Deep learning and deep reinforcement learning for query-based summarisation	Nov 1, 2018	Deep LearningDeep Reinforcement Learning	—Unverified
AttentionMeSH: Simple, Effective and Interpretable Automatic MeSH Indexer	Nov 1, 2018	ArticlesInformation Retrieval	—Unverified
Improving Machine Reading Comprehension with General Reading Strategies	Oct 31, 2018	ARCLanguage Modeling	CodeCode Available
On the Effectiveness of Minimal Context Selection for Robust Question Answering	Oct 30, 2018	Adversarial RobustnessQuestion Answering	—Unverified
Compositional Attention Networks for Interpretability in Natural Language Question Answering	Oct 30, 2018	Logical ReasoningQuestion Answering	—Unverified
ReviewQA: a relational aspect-based opinion reading dataset	Oct 29, 2018	Question Answering	—Unverified
Do Explanations make VQA Models more Predictable to a Human?	Oct 29, 2018	Question AnsweringVisual Question Answering	—Unverified
TallyQA: Answering Complex Counting Questions	Oct 29, 2018	AttributeObject Counting	CodeCode Available

Show:10 25 50

← PrevPage 348 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified