Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10001–10025 of 10817 papers

Title	Date	Tasks	Status
Neural Programmer: Inducing Latent Programs with Gradient Descent	Nov 16, 2015	Question Answeringspeech-recognition	—Unverified
Yin and Yang: Balancing and Answering Binary Visual Questions	Nov 16, 2015	Question AnsweringVisual Question Answering	—Unverified
Uncovering Temporal Context for Video Question and Answering	Nov 15, 2015	DecoderMultiple-choice	—Unverified
Word Embedding based Correlation Model for Question/Answer Matching	Nov 15, 2015	Question AnsweringTranslation	—Unverified
Visual7W: Grounded Question Answering in Images	Nov 11, 2015	Multiple-choiceMultiple Choice Question Answering (MCQA)	—Unverified
Explicit Knowledge-based Reasoning for Visual Question Answering	Nov 9, 2015	Question AnsweringVisual Question Answering	—Unverified
Distributed Deep Learning for Question Answering	Nov 3, 2015	Answer SelectionDeep Learning	—Unverified
Enriching entity grids and graphs with discourse relations: the impact in local coherence evaluation	Nov 1, 2015	Coherence EvaluationQuestion Answering	—Unverified
Um novo corpo e os seus desafios (A new corpus and the challenges it offers)	Nov 1, 2015	Question AnsweringSentiment Analysis	—Unverified
Semi-Automatic Construction of a Textual Entailment Dataset: Selecting Candidates with Vector Space Models	Nov 1, 2015	Natural Language InferenceQuestion Answering	—Unverified
Empirical Study on Deep Learning Models for Question Answering	Oct 26, 2015	Deep LearningMachine Translation	—Unverified
A Graph Traversal Based Approach to Answer Non-Aggregation Questions Over DBpedia	Oct 16, 2015	Question Answering	—Unverified
Computing Semantic Text Similarity Using Rich Features	Oct 1, 2015	Machine TranslationQuestion Answering	—Unverified
Bidirectional Long Short-Term Memory Networks for Relation Classification	Oct 1, 2015	ClassificationGeneral Classification	—Unverified
RealText-asg: A Model to Present Answers Utilizing the Linguistic Structure of Source Question	Oct 1, 2015	Question AnsweringSentence	—Unverified
Fast and Large-scale Unsupervised Relation Extraction	Oct 1, 2015	ClusteringDimensionality Reduction	—Unverified
Corpus annotation with a linguistic analysis of the associations between event mentions and spatial expressions	Oct 1, 2015	Natural Language InferenceQuestion Answering	—Unverified
More Accurate Question Answering on Freebase	Oct 1, 2015	Learning-To-RankQuestion Answering	CodeCode Available
Enhancing Root Extractors Using Light Stemmers	Oct 1, 2015	Machine TranslationPart-Of-Speech Tagging	—Unverified
Measuring Popularity of Machine-Generated Sentences Using Term Count, Document Frequency, and Dependency Language Model	Oct 1, 2015	Language ModelingLanguage Modelling	—Unverified
Selecting Contextual Peripheral Information for Answer Presentation: The Need for Pragmatic Models	Oct 1, 2015	Question Answering	—Unverified
Measuring an Artificial Intelligence System's Performance on a Verbal IQ Test For Young Children	Sep 11, 2015	Common Sense ReasoningQuestion Answering	—Unverified
On TimeML-Compliant Temporal Expression Extraction in Turkish	Sep 3, 2015	Information Retrievalnamed-entity-recognition	—Unverified
A Neural Network Model for Low-Resource Universal Dependency Parsing	Sep 1, 2015	Dependency ParsingDomain Adaptation	—Unverified
A Baseline Temporal Tagger for all Languages	Sep 1, 2015	AllInformation Retrieval	CodeCode Available

Show:10 25 50

← PrevPage 401 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified