SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 95769600 of 10817 papers

TitleStatusHype
Vision and Language Integration: Moving beyond Objects0
The representation and extraction of qunatitative information0
Domain-Targeted, High Precision Knowledge Extraction0
Parsing with Traces: An O(n4) Algorithm and a Structural Representation0
A Context-aware Attention Network for Interactive Question Answering0
A deep learning approach for predicting the quality of online health expert question-answering services0
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual ReasoningCode1
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions0
Mining Compatible/Incompatible Entities from Question and Answering via Yes/No Answer Classification using Distant Label Expansion0
Attentive Explanations: Justifying Decisions and Pointing to the Evidence0
Multi-Perspective Context Matching for Machine ComprehensionCode0
VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question AnsweringCode0
Reading Comprehension using Entity-based Memory Network0
Tracking the World State with Recurrent Entity NetworksCode2
Knowledge Completion for Generics using Guided Tensor Factorization0
MarioQA: Answering Questions by Watching Gameplay Videos0
Probabilistic Neural Programs0
N-ary Biographical Relation Extraction using Shortest Path Dependencies0
Dialog-based Language Learning0
Visual Question Answering with Question Representation Update (QRU)0
NL2KB: Resolving Vocabulary Gap between Natural Language and Knowledge Base in Knowledge Base Construction and Retrieval0
Natural Language Processing for Intelligent Access to Scientific Information0
PersoNER: Persian Named-Entity RecognitionCode0
Reading and Thinking: Re-read LSTM Unit for Textual Entailment Recognition0
Interactive-Predictive Machine Translation based on Syntactic Constraints of Prefix0
Show:102550
← PrevPage 384 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified