SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1000110025 of 10817 papers

TitleStatusHype
Word Embedding based Correlation Model for Question/Answer Matching0
Visual7W: Grounded Question Answering in Images0
Explicit Knowledge-based Reasoning for Visual Question Answering0
Distributed Deep Learning for Question Answering0
Enriching entity grids and graphs with discourse relations: the impact in local coherence evaluation0
Semi-Automatic Construction of a Textual Entailment Dataset: Selecting Candidates with Vector Space Models0
Um novo corpo e os seus desafios (A new corpus and the challenges it offers)0
Empirical Study on Deep Learning Models for Question Answering0
A Graph Traversal Based Approach to Answer Non-Aggregation Questions Over DBpedia0
More Accurate Question Answering on FreebaseCode0
Fast and Large-scale Unsupervised Relation Extraction0
RealText-asg: A Model to Present Answers Utilizing the Linguistic Structure of Source Question0
Corpus annotation with a linguistic analysis of the associations between event mentions and spatial expressions0
Enhancing Root Extractors Using Light Stemmers0
Bidirectional Long Short-Term Memory Networks for Relation Classification0
Measuring Popularity of Machine-Generated Sentences Using Term Count, Document Frequency, and Dependency Language Model0
Computing Semantic Text Similarity Using Rich Features0
Selecting Contextual Peripheral Information for Answer Presentation: The Need for Pragmatic Models0
Measuring an Artificial Intelligence System's Performance on a Verbal IQ Test For Young Children0
On TimeML-Compliant Temporal Expression Extraction in Turkish0
Recognition of Polish Temporal Expressions0
A VSM-based Statistical Model for the Semantic Relation Interpretation of Noun-Modifier Pairs0
A Procedural Definition of Multi-word Lexical Units0
Pattern Construction for Extracting Domain Terminology0
Predicting the quality of questions on Stackoverflow0
Show:102550
← PrevPage 401 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified