Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7601–7650 of 10817 papers

Title	Date	Tasks	Status
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks	Oct 15, 2020	Active LearningOpen-Domain Question Answering	—Unverified
Hierarchical Poset Decoding for Compositional Generalization in Language	Oct 15, 2020	DecoderQuestion Answering	—Unverified
A Graph Representation of Semi-structured Data for Web Question Answering	Oct 14, 2020	Question Answering	—Unverified
F1 is Not Enough! Models and Evaluation Towards User-Centered Explainable Question Answering	Oct 13, 2020	Model SelectionQuestion Answering	CodeCode Available
Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!	Oct 13, 2020	DiagnosticImage-text Classification	—Unverified
A Wrong Answer or a Wrong Question? An Intricate Relationship between Question Reformulation and Answer Selection in Conversational Question Answering	Oct 13, 2020	Answer SelectionConversational Question Answering	CodeCode Available
Neural, Symbolic and Neural-Symbolic Reasoning on Knowledge Graphs	Oct 12, 2020	Information RetrievalKnowledge Graph Completion	—Unverified
End-to-End Synthetic Data Generation for Domain Adaptation of Question Answering Systems	Oct 12, 2020	DecoderDomain Adaptation	CodeCode Available
BioMegatron: Larger Biomedical Domain Language Model	Oct 12, 2020	Language ModelingLanguage Modelling	—Unverified
Towards Accurate and Reliable Energy Measurement of NLP Models	Oct 11, 2020	Question Answering	CodeCode Available
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks	Oct 10, 2020	Active LearningOpen-Domain Question Answering	CodeCode Available
Localizing Open-Ontology QA Semantic Parsers in a Day Using Machine Translation	Oct 10, 2020	Machine TranslationNMT	CodeCode Available
Beyond Language: Learning Commonsense from Images for Reasoning	Oct 10, 2020	Language ModelingLanguage Modelling	CodeCode Available
Interpretable Neural Computation for Real-World Compositional Visual Question Answering	Oct 10, 2020	Question AnsweringVisual Question Answering	—Unverified
Artificial Intelligence (AI) in Action: Addressing the COVID-19 Pandemic with Natural Language Processing (NLP)	Oct 9, 2020	Emotion RecognitionInformation Retrieval	—Unverified
Relation Classification as Two-way Span-Prediction	Oct 9, 2020	ClassificationGeneral Classification	—Unverified
Characterizing Datasets for Social Visual Question Answering, and the New TinySocial Dataset	Oct 8, 2020	Question AnsweringVisual Question Answering	—Unverified
On the importance of pre-training data volume for compact language models	Oct 8, 2020	FQuADLanguage Modeling	—Unverified
Unsupervised Evaluation for Question Answering with Transformers	Oct 7, 2020	Question Answering	—Unverified
DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling	Oct 7, 2020	Knowledge DistillationQuestion Answering	—Unverified
Learning a Cost-Effective Annotation Policy for Question Answering	Oct 7, 2020	Question Answering	CodeCode Available
Improving QA Generalization by Concurrent Modeling of Multiple Biases	Oct 7, 2020	Extractive Question-AnsweringQuestion Answering	CodeCode Available
Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering	Oct 6, 2020	Optical Character RecognitionOptical Character Recognition (OCR)	—Unverified
Pathological Visual Question Answering	Oct 6, 2020	AI AgentQuestion Answering	—Unverified
Multi-Fact Correction in Abstractive Text Summarization	Oct 6, 2020	Abstractive Text SummarizationNews Summarization	—Unverified
Efficient Meta Lifelong-Learning with Limited Memory	Oct 6, 2020	Lifelong learningMulti-Task Learning	—Unverified
BERT Knows Punta Cana is not just beautiful, it's gorgeous: Ranking Scalar Adjectives with Contextualised Representations	Oct 6, 2020	Natural Language UnderstandingQuestion Answering	CodeCode Available
Joint Semantics and Data-Driven Path Representation for Knowledge Graph Inference	Oct 6, 2020	Link PredictionQuestion Answering	—Unverified
DaNetQA: a yes/no Question Answering Dataset for the Russian Language	Oct 6, 2020	Question AnsweringSentence	—Unverified
Context Modeling with Evidence Filter for Multiple Choice Question Answering	Oct 6, 2020	Machine Reading ComprehensionMultiple-choice	—Unverified
Attention Guided Semantic Relationship Parsing for Visual Question Answering	Oct 5, 2020	ObjectQuestion Answering	—Unverified
Reading Comprehension as Natural Language Inference: A Semantic Analysis	Oct 4, 2020	FormNatural Language Inference	—Unverified
When in Doubt, Ask: Generating Answerable and Unanswerable Questions, Unsupervised	Oct 4, 2020	Language ModelingLanguage Modelling	CodeCode Available
Tell Me How to Ask Again: Question Data Augmentation with Controllable Rewriting in Continuous Space	Oct 4, 2020	Data AugmentationMachine Reading Comprehension	CodeCode Available
CAPTION: Correction by Analyses, POS-Tagging and Interpretation of Objects using only Nouns	Oct 2, 2020	Image Captioningobject-detection	—Unverified
ARES: A Reading Comprehension Ensembling Service	Oct 1, 2020	Machine Reading ComprehensionNatural Questions	—Unverified
Mongolian Questions Classification Based on Mulit-Head Attention	Oct 1, 2020	ClassificationQuestion Answering	—Unverified
Beyond The Text: Analysis of Privacy Statements through Syntactic and Semantic Role Labeling	Oct 1, 2020	Question AnsweringSemantic Role Labeling	—Unverified
LiveQA: A Question Answering Dataset over Sports Live	Oct 1, 2020	Multiple-choiceQuestion Answering	CodeCode Available
基于多头注意力和BiLSTM改进DAM模型的中文问答匹配方法(Chinese question answering method based on multi-head attention and BiLSTM improved DAM model)	Oct 1, 2020	Deep AttentionQuestion Answering	—Unverified
ISAAQ -- Mastering Textbook Questions with Pre-trained Transformers and Bottom-Up and Top-Down Attention	Oct 1, 2020	Multiple-choiceQuestion Answering	—Unverified
CoSaTa: A Constraint Satisfaction Solver and Interpreted Language for Semi-Structured Tables of Sentences	Oct 1, 2020	Question Answering	CodeCode Available
A Technical Question Answering System with Transfer Learning	Oct 1, 2020	Question AnsweringTransfer Learning	CodeCode Available
How State-Of-The-Art Models Can Deal With Long-Form Question Answering	Oct 1, 2020	FormLong Form Question Answering	—Unverified
Combining Impression Feature Representation for Multi-turn Conversational Question Answering	Oct 1, 2020	Conversational Question Answeringfeature selection	—Unverified
Case-Based Abductive Natural Language Inference	Sep 30, 2020	Natural Language InferenceQuestion Answering	—Unverified
Bridging Information-Seeking Human Gaze and Machine Reading Comprehension	Sep 30, 2020	Machine Reading ComprehensionMultiple-choice	—Unverified
Stochastic Precision Ensemble: Self-Knowledge Distillation for Quantized Deep Neural Networks	Sep 30, 2020	image-classificationImage Classification	—Unverified
A Vietnamese Dataset for Evaluating Machine Reading Comprehension	Sep 30, 2020	ArticlesMachine Reading Comprehension	—Unverified
Graph-based Heuristic Search for Module Selection Procedure in Neural Module Network	Sep 30, 2020	Heuristic SearchQuestion Answering	—Unverified

Show:10 25 50

← PrevPage 153 of 217Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified