Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7601–7625 of 10817 papers

Title	Date	Tasks	Status
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks	Oct 15, 2020	Active LearningOpen-Domain Question Answering	—Unverified
Hierarchical Poset Decoding for Compositional Generalization in Language	Oct 15, 2020	DecoderQuestion Answering	—Unverified
A Graph Representation of Semi-structured Data for Web Question Answering	Oct 14, 2020	Question Answering	—Unverified
F1 is Not Enough! Models and Evaluation Towards User-Centered Explainable Question Answering	Oct 13, 2020	Model SelectionQuestion Answering	CodeCode Available
Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!	Oct 13, 2020	DiagnosticImage-text Classification	—Unverified
A Wrong Answer or a Wrong Question? An Intricate Relationship between Question Reformulation and Answer Selection in Conversational Question Answering	Oct 13, 2020	Answer SelectionConversational Question Answering	CodeCode Available
Neural, Symbolic and Neural-Symbolic Reasoning on Knowledge Graphs	Oct 12, 2020	Information RetrievalKnowledge Graph Completion	—Unverified
End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems	Oct 12, 2020	DecoderDomain Adaptation	CodeCode Available
BioMegatron: Larger Biomedical Domain Language Model	Oct 12, 2020	Language ModelingLanguage Modelling	—Unverified
Towards Accurate and Reliable Energy Measurement of NLP Models	Oct 11, 2020	Question Answering	CodeCode Available
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks	Oct 10, 2020	Active LearningOpen-Domain Question Answering	CodeCode Available
Localizing Open-Ontology QA Semantic Parsers in a Day Using Machine Translation	Oct 10, 2020	Machine TranslationNMT	CodeCode Available
Beyond Language: Learning Commonsense from Images for Reasoning	Oct 10, 2020	Language ModelingLanguage Modelling	CodeCode Available
Interpretable Neural Computation for Real-World Compositional Visual Question Answering	Oct 10, 2020	Question AnsweringVisual Question Answering	—Unverified
Artificial Intelligence (AI) in Action: Addressing the COVID-19 Pandemic with Natural Language Processing (NLP)	Oct 9, 2020	Emotion RecognitionInformation Retrieval	—Unverified
Relation Classification as Two-way Span-Prediction	Oct 9, 2020	ClassificationGeneral Classification	—Unverified
Characterizing Datasets for Social Visual Question Answering, and the New TinySocial Dataset	Oct 8, 2020	Question AnsweringVisual Question Answering	—Unverified
On the importance of pre-training data volume for compact language models	Oct 8, 2020	FQuADLanguage Modeling	—Unverified
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling	Oct 7, 2020	Knowledge DistillationQuestion Answering	—Unverified
Learning a Cost-Effective Annotation Policy for Question Answering	Oct 7, 2020	Question Answering	CodeCode Available
Unsupervised Evaluation for Question Answering with Transformers	Oct 7, 2020	Question Answering	—Unverified
Improving QA Generalization by Concurrent Modeling of Multiple Biases	Oct 7, 2020	Extractive Question-AnsweringQuestion Answering	CodeCode Available
Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering	Oct 6, 2020	Optical Character RecognitionOptical Character Recognition (OCR)	—Unverified
Pathological Visual Question Answering	Oct 6, 2020	AI AgentQuestion Answering	—Unverified
Multi-Fact Correction in Abstractive Text Summarization	Oct 6, 2020	Abstractive Text SummarizationNews Summarization	—Unverified

Show:10 25 50

← PrevPage 305 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified