SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1040110425 of 10817 papers

TitleStatusHype
Semi-compositional Method for Synonym Extraction of Multi-Word Terms0
Towards an environment for the production and the validation of lexical semantic resources0
The DBOX Corpus Collection of Spoken Human-Human and Human-Machine Dialogues0
The CUHK Discourse TreeBank for Chinese: Annotating Explicit Discourse Connectives for the Chinese TreeBank0
The Meta-knowledge of Causality in Biomedical Scientific Discourse0
Towards an Integration of Syntactic and Temporal Annotations in Estonian0
WordNet---Wikipedia---Wiktionary: Construction of a Three-way Alignment0
The Multilingual Paraphrase Database0
The Language Application Grid0
Open Question Answering with Weakly Supervised Embedding Models0
Empirically-motivated Generalizations of CCG Semantic Parsing Learning Algorithms0
A System for Building FrameNet-like Corpus for the Biomedical Domain0
Automatic Building and Using Parallel Resources for SMT from Comparable Corpora0
Multi-threaded Interaction Management for Dynamic Spatial Applications0
Recognizing Causality in Verb-Noun Pairs via Noun and Verb Semantics0
Inducing Example-based Semantic Frames from a Massive Amount of Verb Uses0
Answering List Questions using Web as a corpus0
Encoding Semantic Resources in Syntactic Structures for Passage Reranking0
A Graph-Based Approach to String Regeneration0
Discovering Implicit Discourse Relations Through Brown Cluster Pair Representation and Coreference Patterns0
A Hierarchical Bayesian Model for Unsupervised Induction of Script Knowledge0
A Natural Language Instructor for pedestrian navigation based in generation by selection0
PARADIGM: Paraphrase Diagnostics through Grammar Matching0
Chinese Open Relation Extraction for Knowledge Acquisition0
Domain Adaptation with Active Learning for Coreference Resolution0
Show:102550
← PrevPage 417 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified