Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10476–10500 of 10817 papers

Title	Date	Tasks	Status
Detecting Bot-Answerable Questions in Ubuntu Chat	Oct 1, 2013	Question Answering	—Unverified
Detecting Spammers in Community Question Answering	Oct 1, 2013	Community Question AnsweringQuestion Answering	—Unverified
Feature Selection Using a Semantic Hierarchy for Event Recognition and Type Classification	Oct 1, 2013	feature selectionGeneral Classification	—Unverified
JoBimText Visualizer: A Graph-based Approach to Contextualizing Distributional Similarity	Oct 1, 2013	Domain AdaptationGraph Clustering	—Unverified
Developing ML-based Systems to Extract Medical Information from Japanese Medical History Summaries	Oct 1, 2013	Information RetrievalNamed Entity Recognition (NER)	—Unverified
Malayalam Clause Boundary Identifier: Annotation and Evaluation	Oct 1, 2013	ChunkingMachine Translation	—Unverified
Harvesting Parallel News Streams to Generate Paraphrases of Event Relations	Oct 1, 2013	Machine TranslationQuestion Answering	—Unverified
Automatic Feature Engineering for Answer Selection and Extraction	Oct 1, 2013	Answer SelectionFeature Engineering	—Unverified
Growing Multi-Domain Glossaries from a Few Seeds using Probabilistic Topic Models	Oct 1, 2013	Question AnsweringTopic Models	—Unverified
A Hierarchical Entity-Based Approach to Structuralize User Generated Content in Social Media: A Case of Yahoo! Answers	Oct 1, 2013	Information RetrievalLanguage Modelling	—Unverified
Interpreting Anaphoric Shell Nouns using Antecedents of Cataphoric Shell Nouns as Training Data	Oct 1, 2013	Question AnsweringText Summarization	—Unverified
A Discourse-Driven Content Model for Summarising Scientific Articles Evaluated in a Complex Question Answering Task	Oct 1, 2013	ArticlesQuestion Answering	—Unverified
A Dataset for Research on Short-Text Conversations	Oct 1, 2013	ChatbotQuestion Answering	—Unverified
Learning Biological Processes with Global Constraints	Oct 1, 2013	Question Answering	—Unverified
Exploiting Multiple Sources for Open-Domain Hypernym Discovery	Oct 1, 2013	Hypernym DiscoveryInformation Retrieval	—Unverified
Question Difficulty Estimation in Community Question Answering Services	Oct 1, 2013	Community Question AnsweringQuestion Answering	—Unverified
Event-Based Time Label Propagation for Automatic Dating of News Articles	Oct 1, 2013	ArticlesInformation Retrieval	—Unverified
Exploiting Language Models for Visual Recognition	Oct 1, 2013	Language ModellingMachine Translation	—Unverified
The Answer is at your Fingertips: Improving Passage Retrieval for Web Question Answering with Search Behavior Data	Oct 1, 2013	Passage RetrievalQuestion Answering	—Unverified
Using Paraphrases and Lexical Semantics to Improve the Accuracy and the Robustness of Supervised Models in Situated Dialogue Systems	Oct 1, 2013	Dialogue ManagementParaphrase Generation	—Unverified
Scaling Semantic Parsers with On-the-Fly Ontology Matching	Oct 1, 2013	Ontology MatchingQuestion Answering	—Unverified
Unsupervised Induction of Cross-Lingual Semantic Relations	Oct 1, 2013	Information RetrievalMachine Translation	—Unverified
Unsupervised Relation Extraction with General Domain Knowledge	Oct 1, 2013	Information RetrievalQuestion Answering	—Unverified
Semi-Markov Phrase-Based Monolingual Alignment	Oct 1, 2013	Machine TranslationNatural Language Inference	—Unverified
Learning to answer questions	Sep 4, 2013	Open-Domain Question AnsweringQuestion Answering	—Unverified

Show:10 25 50

← PrevPage 420 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified