SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1012610150 of 10817 papers

TitleStatusHype
Semantic Structure Analysis of Noun Phrases using Abstract Meaning Representation0
Sentiment and Belief: How to Think about, Represent, and Annotate Private States0
Transition-based Dependency DAG Parsing Using Dynamic Oracles0
Thread-Level Information for Comment Classification in Community Question Answering0
Structural Representations for Learning Relations between Pairs of Texts0
Occam's Gates0
Ask Me Anything: Dynamic Memory Networks for Natural Language ProcessingCode0
Answer Sequence Learning with Neural Networks for Answer Selection in Community Question Answering0
The WDAqua ITN: Answering Questions using Web Data0
Leveraging Textual Features for Best Answer Prediction in Community-based Question Answering0
Large-scale Simple Question Answering with Memory NetworksCode0
What value do explicit high level concepts have in vision to language problems?Code0
Semantic Annotation of Japanese Functional Expressions and its Impact on Factuality Analysis0
yiGou: A Semantic Text Similarity Computing System Based on SVM0
TALN-UPF: Taxonomy Learning Exploiting CRF-Based Hypernym Extraction on Encyclopedic Definitions0
VectorSLU: A Continuous Word Vector Approach to Answer Selection in Community Question Answering Systems0
Semantics-based Graph Approach to Complex Question-Answering0
Voltron: A Hybrid System For Answer Validation Based On Lexical And Distance Features0
VisKE: Visual Knowledge Extraction and Question Answering by Visual Verification of Relation Phrases0
TATO: Leveraging on Multiple Strategies for Semantic Textual Similarity0
SemEval-2015 Task 17: Taxonomy Extraction Evaluation (TExEval)0
SemEval-2015 Task 14: Analysis of Clinical Text0
SemEval-2015 Task 2: Semantic Textual Similarity, English, Spanish and Pilot on Interpretability0
Towards a Better Semantic Role Labeling of Complex Predicates0
SemEval-2015 Task 5: QA TempEval - Evaluating Temporal Information Understanding with Question Answering0
Show:102550
← PrevPage 406 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified