SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 94269450 of 10817 papers

TitleStatusHype
Seq2seq for Morphological Reinflection: When Deep Learning Fails0
SemEval-2017 Task 2: Multilingual and Cross-lingual Semantic Word Similarity0
The Rich Event Ontology0
SCIR-QA at SemEval-2017 Task 3: CNN Model Based on Similar and Dissimilar Information between Keywords for Question Similarity0
Towards Harnessing Memory Networks for Coreference Resolution0
UParse: the Edinburgh system for the CoNLL 2017 UD shared task0
The Event StoryLine Corpus: A New Benchmark for Causal and Temporal Relation Extraction0
Ways of Asking and Replying in Duplicate Question Detection0
Sentence Alignment using Unfolding Recursive Autoencoders0
MEMEN: Multi-layer Embedding with Memory Networks for Machine Comprehension0
Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering0
Question Dependent Recurrent Entity Network for Question AnsweringCode0
Hyperbolic Representation Learning for Fast and Efficient Neural Question AnsweringCode0
MatchZoo: A Toolkit for Deep Text MatchingCode0
Adversarial Examples for Evaluating Reading Comprehension SystemsCode0
Improved Bilinear Pooling with CNNs0
Large-Scale Goodness Polarity Lexicons for Community Question Answering0
Video Question Answering via Attribute-Augmented Attention Network Learning0
Visual Question Answering with Memory-Augmented Networks0
Evaluating Semantic Parsing against a Simple Web-based Question Answering ModelCode0
Quasar: Datasets for Question Answering by Search and ReadingCode0
Effective Approaches to Batch Parallelization for Dynamic Neural Network ArchitecturesCode0
Long-Term Memory Networks for Question Answering0
An Attention Mechanism for Answer Selection Using a Combined Global and Local View0
DeepStory: Video Story QA by Deep Embedded Memory Networks0
Show:102550
← PrevPage 378 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified