SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 70517075 of 10817 papers

TitleStatusHype
QA Dataset Explosion: A Taxonomy of NLP Resources for Question Answering and Reading Comprehension0
Thought Flow Nets: From Single Predictions to Trains of Model Thought0
Hybrid Autoregressive Inference for Scalable Multi-hop Explanation RegenerationCode0
X-GGM: Graph Generative Modeling for Out-of-Distribution Generalization in Visual Question AnsweringCode0
The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding0
CogME: A Cognition-Inspired Multi-Dimensional Evaluation Metric for Story Understanding0
Bridging the Gap between Language Model and Reading Comprehension: Unsupervised MRC via Self-Supervision0
A Discriminative Semantic Ranker for Question Retrieval0
Are Multilingual Models the Best Choice for Moderately Under-resourced Languages? A Comprehensive Assessment for Catalan0
Exploiting Rich Syntax for Better Knowledge Base Question Answering0
A Survey of Knowledge Graph Embedding and Their Applications0
Applications of knowledge graphs for food science and industry0
Human Attention during Goal-directed Reading Comprehension Relies on Task OptimizationCode0
Answer Complex Questions: Path Ranker Is All You NeedCode0
Similar Cases Recommendation using Legal Knowledge GraphsCode0
An Initial Investigation of Non-Native Spoken Question-Answering0
Joint Models for Answer Verification in Question Answering Systems0
Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review0
MuVAM: A Multi-View Attention-based Model for Medical Visual Question Answering0
MedGPT: Medical Concept Prediction from Clinical Narratives0
Robustifying Multi-hop QA through Pseudo-Evidentiality Training0
Question Answering over Knowledge Graphs with Neural Machine Translation and Entity Linking0
A Knowledge-based Approach for Answering Complex Questions in Persian0
NOTE: Solution for KDD-CUP 2021 WikiKG90M-LSC0
Coarse-to-Careful: Seeking Semantic-related Knowledge for Open-domain Commonsense Question Answering0
Show:102550
← PrevPage 283 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified