SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 98019825 of 10817 papers

TitleStatusHype
``Look, some Green Circles!'': Learning to Quantify from Images0
Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text0
A Shared Task on Multimodal Machine Translation and Crosslingual Image Description0
Improving Temporal Relation Extraction with Training Instance Augmentation0
Together we stand: Siamese Networks for Similar Question Retrieval0
Unanimous Prediction for 100\% Precision with Application to Learning Semantic Mappings0
SHEF-Multimodal: Grounding Machine Translation on Images0
Why ``Blow Out''? A Structural Analysis of the Movie Dialog Dataset0
Supersense Embeddings: A Unified Model for Supersense Interpretation, Prediction, and Utilization0
Tables as Semi-structured Knowledge for Question Answering0
VERSE: Event and Relation Extraction in the BioNLP 2016 Shared Task0
TransG : A Generative Model for Knowledge Graph Embedding0
TranscRater: a Tool for Automatic Speech Recognition Quality Estimation0
Speech Act Modeling of Written Asynchronous Conversations with Task-Specific Embeddings and Conditional Structured Models0
The Value of Semantic Parse Labeling for Knowledge Base Question Answering0
Dataset and Neural Recurrent Sequence Labeling Model for Open-Domain Factoid Question AnsweringCode0
Neural Contextual Conversation Learning with Labeled Question-Answering Pairs0
An Empirical Evaluation of various Deep Learning Architectures for Bi-Sequence Classification Tasks0
Attention-over-Attention Neural Networks for Reading ComprehensionCode0
Neural Semantic EncodersCode0
Using Recurrent Neural Network for Learning Expressive Ontologies0
Estimating Uncertainty Online Against an Adversary0
Open-Vocabulary Semantic Parsing with both Distributional Statistics and Formal KnowledgeCode0
Mapping distributional to model-theoretic semantic spaces: a baselineCode0
Annotation Methodologies for Vision and Language Dataset Creation0
Show:102550
← PrevPage 393 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified