SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 96019625 of 10817 papers

TitleStatusHype
MarioQA: Answering Questions by Watching Gameplay Videos0
Probabilistic Neural Programs0
Neural Attention for Learning to Rank Questions in Community Question Answering0
Constraint-Based Question Answering with Knowledge GraphCode0
Dialog-based Language Learning0
ENIAM: Categorial Syntactic-Semantic Parser for Polish0
CogALex-V Shared Task: CGSRC - Classifying Semantic Relations using Convolutional Neural Networks0
Global Inference to Chinese Temporal Relation Extraction0
Implicit Discourse Relation Recognition with Context-aware Character-enhanced Embeddings0
Large-Scale Acquisition of Commonsense Knowledge via a Quiz Game on a Dialogue System0
Different Contexts Lead to Different Word Embeddings0
Mongolian Named Entity Recognition System with Rich Features0
From Entity Linking to Question Answering -- Recent Progress on Semantic Grounding Tasks0
A Hierarchical Neural Network for Information Extraction of Product Attribute and Condition Sentences0
A Supervised Approach for Enriching the Relational Structure of Frame Semantics in FrameNet0
Categorization of Semantic Roles for Dictionary Definitions0
Multilingual Supervision of Semantic Annotation0
Distributional Hypernym Generation by Jointly Learning Clusters and Projections0
Pairwise Relation Classification with Mirror Instances and a Combined Convolutional Neural Network0
Pairing Wikipedia Articles Across Languages0
CATENA: CAusal and TEmporal relation extraction from NAtural language textsCode0
Reading and Thinking: Re-read LSTM Unit for Textual Entailment Recognition0
High Accuracy Rule-based Question Classification using Question Syntax and Semantics0
Multilingual Aliasing for Auto-Generating Proposition Banks0
Annotation and Analysis of Discourse Relations, Temporal Relations and Multi-Layered Situational Relations in Japanese Texts0
Show:102550
← PrevPage 385 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified