Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7301–7325 of 10817 papers

Title	Date	Tasks	Status	Hype
A Graph Representation of Semi-structured Data for Web Question Answering	Oct 14, 2020	Question Answering	—Unverified	0
Length-Adaptive Transformer: Train Once with Length Drop, Use Anytime with Search	Oct 14, 2020	ClassificationQuestion Answering	CodeCode Available	1
A Wrong Answer or a Wrong Question? An Intricate Relationship between Question Reformulation and Answer Selection in Conversational Question Answering	Oct 13, 2020	Answer SelectionConversational Question Answering	CodeCode Available	0
CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation Transferring	Oct 13, 2020	Question AnsweringRelation	CodeCode Available	1
F1 is Not Enough! Models and Evaluation Towards User-Centered Explainable Question Answering	Oct 13, 2020	Model SelectionQuestion Answering	CodeCode Available	0
Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!	Oct 13, 2020	DiagnosticImage-text Classification	—Unverified	0
Contrast and Classify: Training Robust VQA Models	Oct 13, 2020	Contrastive LearningData Augmentation	CodeCode Available	1
Cross-Modal BERT for Text-Audio Sentiment Analysis	Oct 12, 2020	Multimodal Sentiment AnalysisNatural Language Inference	CodeCode Available	1
End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems	Oct 12, 2020	DecoderDomain Adaptation	CodeCode Available	0
BioMegatron: Larger Biomedical Domain Language Model	Oct 12, 2020	Language ModelingLanguage Modelling	—Unverified	0
Neural, Symbolic and Neural-Symbolic Reasoning on Knowledge Graphs	Oct 12, 2020	Information RetrievalKnowledge Graph Completion	—Unverified	0
Counterfactual Variable Control for Robust and Interpretable Question Answering	Oct 12, 2020	Causal Inferencecounterfactual	CodeCode Available	1
Towards Accurate and Reliable Energy Measurement of NLP Models	Oct 11, 2020	Question Answering	CodeCode Available	0
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks	Oct 10, 2020	Active LearningOpen-Domain Question Answering	CodeCode Available	0
Localizing Open-Ontology QA Semantic Parsers in a Day Using Machine Translation	Oct 10, 2020	Machine TranslationNMT	CodeCode Available	0
Beyond Language: Learning Commonsense from Images for Reasoning	Oct 10, 2020	Language ModelingLanguage Modelling	CodeCode Available	0
Interpretable Neural Computation for Real-World Compositional Visual Question Answering	Oct 10, 2020	Question AnsweringVisual Question Answering	—Unverified	0
Open-Domain Question Answering Goes Conversational via Question Rewriting	Oct 10, 2020	Conversational Question AnsweringOpen-Domain Question Answering	CodeCode Available	1
Artificial Intelligence (AI) in Action: Addressing the COVID-19 Pandemic with Natural Language Processing (NLP)	Oct 9, 2020	Emotion RecognitionInformation Retrieval	—Unverified	0
Relation Classification as Two-way Span-Prediction	Oct 9, 2020	ClassificationGeneral Classification	—Unverified	0
AutoQA: From Databases To QA Semantic Parsers With Only Synthetic Training Data	Oct 9, 2020	AttributeNatural Questions	CodeCode Available	1
Characterizing Datasets for Social Visual Question Answering, and the New TinySocial Dataset	Oct 8, 2020	Question AnsweringVisual Question Answering	—Unverified	0
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name Recognition	Oct 8, 2020	Question AnsweringWorld Knowledge	CodeCode Available	1
On the importance of pre-training data volume for compact language models	Oct 8, 2020	FQuADLanguage Modeling	—Unverified	0
Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data	Oct 7, 2020	AttributeQuestion Answering	CodeCode Available	1

Show:10 25 50

← PrevPage 293 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified