SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 86768700 of 10817 papers

TitleStatusHype
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering0
An Interactive Web-Interface for Visualizing the Inner Workings of the Question Answering LSTM0
Explaining non-linear Classifier Decisions within Kernel-based Deep Architectures0
Expletives in Universal Dependency TreebanksCode0
CogCompTime: A Tool for Understanding Time in Natural Language0
Exploiting Attention to Reveal Shortcomings in Memory Models0
Preferred Answer Selection in Stack Overflow: Better Text Representations ... and Metadata, Metadata, Metadata0
OpenKE: An Open Toolkit for Knowledge EmbeddingCode0
Proceedings of the 6th BioASQ Workshop A challenge on large-scale biomedical semantic indexing and question answering0
Automatic Opinion Question GenerationCode0
Ontology-Based Retrieval \& Neural Approaches for BioASQ Ideal Answer Generation0
Extraction Meets Abstraction: Ideal Answer Generation for Biomedical Questions0
Retrieve and Re-rank: A Simple and Effective IR Approach to Simple Question Answering over Knowledge Graphs0
On the Generation of Medical Question-Answer Pairs0
Interactive Instance-based Evaluation of Knowledge Base Question AnsweringCode0
An Adaption of BIOASQ Question Answering dataset for Machine Reading systems by Manual Annotations of Answer Spans.0
Results of the sixth edition of the BioASQ Challenge0
Macquarie University at BioASQ 6b: Deep learning and deep reinforcement learning for query-based summarisation0
AttentionMeSH: Simple, Effective and Interpretable Automatic MeSH Indexer0
Improving Machine Reading Comprehension with General Reading StrategiesCode0
On the Effectiveness of Minimal Context Selection for Robust Question Answering0
Compositional Attention Networks for Interpretability in Natural Language Question Answering0
ReviewQA: a relational aspect-based opinion reading dataset0
Do Explanations make VQA Models more Predictable to a Human?0
TallyQA: Answering Complex Counting QuestionsCode0
Show:102550
← PrevPage 348 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified