SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1000110025 of 10817 papers

TitleStatusHype
Neural Programmer: Inducing Latent Programs with Gradient Descent0
Yin and Yang: Balancing and Answering Binary Visual Questions0
Uncovering Temporal Context for Video Question and Answering0
Word Embedding based Correlation Model for Question/Answer Matching0
Visual7W: Grounded Question Answering in Images0
Explicit Knowledge-based Reasoning for Visual Question Answering0
Distributed Deep Learning for Question Answering0
Enriching entity grids and graphs with discourse relations: the impact in local coherence evaluation0
Um novo corpo e os seus desafios (A new corpus and the challenges it offers)0
Semi-Automatic Construction of a Textual Entailment Dataset: Selecting Candidates with Vector Space Models0
Empirical Study on Deep Learning Models for Question Answering0
A Graph Traversal Based Approach to Answer Non-Aggregation Questions Over DBpedia0
Computing Semantic Text Similarity Using Rich Features0
Bidirectional Long Short-Term Memory Networks for Relation Classification0
RealText-asg: A Model to Present Answers Utilizing the Linguistic Structure of Source Question0
Fast and Large-scale Unsupervised Relation Extraction0
Corpus annotation with a linguistic analysis of the associations between event mentions and spatial expressions0
More Accurate Question Answering on FreebaseCode0
Enhancing Root Extractors Using Light Stemmers0
Measuring Popularity of Machine-Generated Sentences Using Term Count, Document Frequency, and Dependency Language Model0
Selecting Contextual Peripheral Information for Answer Presentation: The Need for Pragmatic Models0
Measuring an Artificial Intelligence System's Performance on a Verbal IQ Test For Young Children0
On TimeML-Compliant Temporal Expression Extraction in Turkish0
A Neural Network Model for Low-Resource Universal Dependency Parsing0
A Baseline Temporal Tagger for all LanguagesCode0
Show:102550
← PrevPage 401 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified