SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 96519675 of 10817 papers

TitleStatusHype
Annotation and Analysis of Discourse Relations, Temporal Relations and Multi-Layered Situational Relations in Japanese Texts0
Problematic Cases in the Annotation of Negation in Spanish0
Categorization of Semantic Roles for Dictionary Definitions0
A Hierarchical Neural Network for Information Extraction of Product Attribute and Condition Sentences0
QAF: Frame Semantics-based Question Interpretation0
Combining Lexical and Semantic-based Features for Answer Sentence Selection0
Pairing Wikipedia Articles Across Languages0
Proceedings of the Open Knowledge Base and Question Answering Workshop (OKBQA 2016)0
Entity-Supported Summarization of Biomedical Abstracts0
An Entity-Based approach to Answering Recurrent and Non-Recurrent Questions with Past Answers0
Filling a Knowledge Graph with a Crowd0
CogALex-V Shared Task: CGSRC - Classifying Semantic Relations using Convolutional Neural Networks0
Microsyntactic Phenomena as a Computational Linguistics Issue0
Feature-Rich Twitter Named Entity Recognition and Classification0
Name Variation in Community Question Answering Systems0
Neural Clinical Paraphrase Generation with Attention0
From Entity Linking to Question Answering -- Recent Progress on Semantic Grounding Tasks0
Character-Aware Neural Networks for Arabic Named Entity Recognition for Social Media0
Improvement of VerbNet-like resources by frame typing0
Compound Type Identification in Sanskrit: What Roles do the Corpus and Grammar Play?0
A Dataset for Multimodal Question Answering in the Cultural Heritage Domain0
The Development of Multimodal Lexical Resources0
Dialogue Learning With Human-In-The-LoopCode2
MS MARCO: A Human Generated MAchine Reading COmprehension DatasetCode1
Learning a Natural Language Interface with Neural ProgrammerCode0
Show:102550
← PrevPage 387 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified