Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5751–5775 of 10817 papers

Title	Date	Tasks	Status	Hype
ChartQA: A Benchmark for Question Answering about Charts with Visual and Logical Reasoning	Mar 19, 2022	Chart Question AnsweringLogical Reasoning	CodeCode Available	2
Sequence-to-Sequence Knowledge Graph Completion and Question Answering	Mar 19, 2022	DecoderGraph Embedding	CodeCode Available	1
Are You Robert or RoBERTa? Deceiving Online Authorship Attribution Models Using Neural Text Generators	Mar 18, 2022	Authorship AttributionQuestion Answering	—Unverified	0
Simulating Bandit Learning from User Feedback for Extractive Question Answering	Mar 18, 2022	Extractive Question-AnsweringQuestion Answering	CodeCode Available	1
DP-KB: Data Programming with Knowledge Bases Improves Transformer Fine Tuning for Answer Sentence Selection	Mar 17, 2022	Entity LinkingFact Verification	—Unverified	0
elBERto: Self-supervised Commonsense Learning for Question Answering	Mar 17, 2022	Question AnsweringRepresentation Learning	—Unverified	0
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering	Mar 17, 2022	Implicit RelationsQuestion Answering	CodeCode Available	1
Knowledge Graph Embedding Methods for Entity Alignment: An Experimental Review	Mar 17, 2022	AttributeEntity Alignment	CodeCode Available	1
Ask to Understand: Question Generation for Multi-hop Question Answering	Mar 17, 2022	DiversityMulti-hop Question Answering	—Unverified	0
A New Quantum CNN Model for Image Classification	Mar 16, 2022	Classificationimage-classification	—Unverified	0
C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References	Mar 16, 2022	Open-Domain Question AnsweringQuestion Answering	CodeCode Available	0
Synthetic Question Value Estimation for Domain Adaptation of Question Answering	Mar 16, 2022	Domain AdaptationQuestion Answering	CodeCode Available	1
E-KAR: A Benchmark for Rationalizing Natural Language Analogical Reasoning	Mar 16, 2022	Explanation GenerationQuestion Answering	—Unverified	0
Training Data is More Valuable than You Think: A Simple and Effective Method by Retrieving from Training Data	Mar 16, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Can you even tell left from right? Presenting a new challenge for VQA	Mar 15, 2022	Question AnsweringVisual Question Answering	—Unverified	0
Revitalize Region Feature for Democratizing Video-Language Pre-training of Retrieval	Mar 15, 2022	Question AnsweringRetrieval	CodeCode Available	1
Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial Robustness	Mar 15, 2022	Adversarial RobustnessData Augmentation	—Unverified	0
Reasoning over Public and Private Data in Retrieval-Based Systems	Mar 14, 2022	Fact CheckingInformation Retrieval	CodeCode Available	1
Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering	Mar 14, 2022	DecoderDomain Generalization	—Unverified	0
Towards Semantic Search for Community Question Answering for Mortgage Officers	Mar 14, 2022	Community Question AnsweringDomain Adaptation	—Unverified	0
ScienceWorld: Is your Agent Smarter than a 5th Grader?	Mar 14, 2022	Question Answering	CodeCode Available	2
Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering	Mar 14, 2022	Open-Domain Question AnsweringPassage Retrieval	CodeCode Available	1
All in One: Exploring Unified Video-Language Pre-training	Mar 14, 2022	AllLanguage Modelling	CodeCode Available	2
CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment	Mar 14, 2022	parameter-efficient fine-tuningQuestion Answering	—Unverified	0
Efficient Language Modeling with Sparse all-MLP	Mar 14, 2022	AllCommon Sense Reasoning	—Unverified	0

Show:10 25 50

← PrevPage 231 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified