SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1040110425 of 10817 papers

TitleStatusHype
JUST.ASK, a QA system that learns to answer new questions from previous interactions0
HFST-SweNER --- A New NER Resource for Swedish0
First approach toward Semantic Role Labeling for Basque0
A Multimodal Corpus of Rapid Dialogue Games0
Predicate Matrix: extending SemLink through WordNet mappings0
Crowdsourcing for the identification of event nominals: an experiment0
A hierarchical taxonomy for classifying hardness of inference tasks0
Annotating Question Decomposition on Complex Medical Questions0
Creating and using large monolingual parallel corpora for sentential paraphrase generation0
Open Question Answering with Weakly Supervised Embedding Models0
Recognizing Causality in Verb-Noun Pairs via Noun and Verb Semantics0
A System for Building FrameNet-like Corpus for the Biomedical Domain0
Domain Adaptation with Active Learning for Coreference Resolution0
Automatic Building and Using Parallel Resources for SMT from Comparable Corpora0
A Natural Language Instructor for pedestrian navigation based in generation by selection0
Multi-threaded Interaction Management for Dynamic Spatial Applications0
Tuning HeidelTime for identifying time expressions in clinical texts in English and French0
Unsupervised Relation Extraction of In-Domain Data from Focused Crawls0
Speech-Enabled Hybrid Multilingual Translation for Mobile Devices0
Inducing Example-based Semantic Frames from a Massive Amount of Verb Uses0
Answering List Questions using Web as a corpus0
PARADIGM: Paraphrase Diagnostics through Grammar Matching0
Empirically-motivated Generalizations of CCG Semantic Parsing Learning Algorithms0
Chinese Open Relation Extraction for Knowledge Acquisition0
A Hierarchical Bayesian Model for Unsupervised Induction of Script Knowledge0
Show:102550
← PrevPage 417 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified