SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 68766900 of 10817 papers

TitleStatusHype
Attention-guided Generative Models for Extractive Question Answering0
A Survey on Legal Question Answering Systems0
Explainable Fact-checking through Question Answering0
Beyond Accuracy: A Consolidated Tool for Visual Question Answering BenchmarkingCode0
AskMe: Joint Individual-level and Community-level Behavior Interaction for Question Recommendation0
Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization0
A Framework for Rationale Extraction for Deep QA models0
The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design0
KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering0
Multi-tasking Dialogue Comprehension with Discourse ParsingCode0
A Comparative Study of Transformer-Based Language Models on Extractive Question Answering0
Noisy Text Data: Achilles' Heel of popular transformer based NLP models0
GNN is a Counter? Revisiting GNN for Question Answering0
COVIDRead: A Large-scale Question Answering Dataset on COVID-190
Perhaps PTLMs Should Go to School -- A Task to Assess Open Book and Closed Book QA0
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering0
Asking questions on handwritten document collections0
Generating User-Centred Explanations via Illocutionary Question Answering: From Philosophy to InterfacesCode0
Perhaps PTLMs Should Go to School – A Task to Assess Open Book and Closed Book QA0
A Flexible and Extensible Framework for Multiple Answer Modes Question Answering0
Self-Attentive Constituency Parsing for UCCA-based Semantic Parsing0
Generating Justifications in a Spatial Question-Answering Dialogue System for a Blocks World0
Breaking Down Questions for Outside-Knowledge VQA0
Variational Disentangled Attention for Regularized Visual Dialog0
Measuring CLEVRness: Black-box Testing of Visual Reasoning Models0
Show:102550
← PrevPage 276 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified