Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8876–8900 of 10817 papers

Title	Date	Tasks	Status
Multimodal Neural Machine Translation for Low-resource Language Pairs using Synthetic Data	Jul 1, 2018	Image DescriptionMachine Translation	—Unverified
A Framework for Developing and Evaluating Word Embeddings of Drug-named Entity	Jul 1, 2018	Named Entity Recognition (NER)Outlier Detection	—Unverified
Keyphrases Extraction from User-Generated Contents in Healthcare Domain Using Long Short-Term Memory Networks	Jul 1, 2018	Question AnsweringText Classification	—Unverified
Natural Language Inference with Definition Embedding Considering Context On the Fly	Jul 1, 2018	Domain AdaptationInformation Retrieval	—Unverified
A Multi-Stage Memory Augmented Neural Network for Machine Reading Comprehension	Jul 1, 2018	Machine Reading ComprehensionQuestion Answering	—Unverified
BioAMA: Towards an End to End BioMedical Question Answering System	Jul 1, 2018	Natural Language InferenceNER	—Unverified
Proceedings of the Workshop on Machine Reading for Question Answering	Jul 1, 2018	Question AnsweringReading Comprehension	—Unverified
Phrase2VecGLM: Neural generalized language model--based semantic tagging for complex query reformulation in medical IR	Jul 1, 2018	Document RankingInformation Retrieval	—Unverified
A Simple End-to-End Question Answering Model for Product Information	Jul 1, 2018	Answer SelectionQuestion Answering	—Unverified
Affordances in Grounded Language Learning	Jul 1, 2018	Grounded language learningQuestion Answering	—Unverified
Neural Models for Key Phrase Extraction and Question Generation	Jul 1, 2018	Question AnsweringQuestion Generation	—Unverified
A Hybrid Learning Scheme for Chinese Word Embedding	Jul 1, 2018	Language ModelingLanguage Modelling	—Unverified
The First Multilingual Surface Realisation Shared Task (SRâ18): Overview and Evaluation Results	Jul 1, 2018	Question AnsweringText Generation	—Unverified
Systematic Error Analysis of the Stanford Question Answering Dataset	Jul 1, 2018	Common Sense ReasoningMachine Reading Comprehension	—Unverified
Tackling Adversarial Examples in QA via Answer Sentence Selection	Jul 1, 2018	ArticlesQuestion Answering	—Unverified
Tackling Code-Switched NER: Participation of CMU	Jul 1, 2018	named-entity-recognitionNamed Entity Recognition	—Unverified
Transliteration Better than Translation? Answering Code-mixed Questions over a Knowledge Base	Jul 1, 2018	Automatic Speech Recognition (ASR)Information Retrieval	—Unverified
Semantically Equivalent Adversarial Rules for Debugging NLP models	Jul 1, 2018	Data AugmentationQuestion Answering	CodeCode Available
The price of debiasing automatic metrics in natural language evalaution	Jul 1, 2018	Abstractive Text SummarizationImage Captioning	—Unverified
To Attend or not to Attend: A Case Study on Syntactic Structures for Semantic Relatedness	Jul 1, 2018	Machine TranslationParaphrase Identification	CodeCode Available
Trick Me If You Can: Adversarial Writing of Trivia Challenge Questions	Jul 1, 2018	Question Answering	—Unverified
Syntax for Semantic Role Labeling, To Be, Or Not To Be	Jul 1, 2018	Dependency ParsingFeature Engineering	CodeCode Available
Visual Attention Model for Name Tagging in Multimodal Social Media	Jul 1, 2018	Natural Language UnderstandingQuestion Answering	—Unverified
Modeling discourse cohesion for discourse parsing via memory network	Jul 1, 2018	Discourse ParsingQuestion Answering	—Unverified
DeepPavlov: Open-Source Library for Dialogue Systems	Jul 1, 2018	General Classificationintent-classification	—Unverified

Show:10 25 50

← PrevPage 356 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified