SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 21512175 of 10817 papers

TitleStatusHype
Asking Questions the Human Way: Scalable Question-Answer Generation from Text CorpusCode1
Retrospective Reader for Machine Reading ComprehensionCode1
ManyModalQA: Modality Disambiguation and QA over Diverse InputsCode1
Schema2QA: High-Quality and Low-Cost Q&A Agents for the Structured WebCode1
Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual FeaturesCode1
In Defense of Grid Features for Visual Question AnsweringCode1
Side-Tuning: A Baseline for Network Adaptation via Additive Side NetworksCode1
T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted AttackCode1
Differentiable Reasoning on Large Knowledge Bases and Natural LanguageCode1
PIQA: Reasoning about Physical Commonsense in Natural LanguageCode1
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question AnsweringCode1
Inductive Relation Prediction by Subgraph ReasoningCode1
Knowledge Guided Text Retrieval and Reading for Open Domain Question AnsweringCode1
Contextualized Sparse Representations for Real-Time Open-Domain Question AnsweringCode1
Multi-domain Dialogue State Tracking as Dynamic Knowledge Graph Enhanced Question AnsweringCode1
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and ComprehensionCode1
MLQA: Evaluating Cross-lingual Extractive Question AnsweringCode1
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighterCode1
Overcoming Data Limitation in Medical Visual Question AnsweringCode1
Reducing Transformer Depth on Demand with Structured DropoutCode1
UNITER: UNiversal Image-TExt Representation LearningCode1
Exploring Scholarly Data by Semantic Query on Knowledge Graph Embedding SpaceCode1
PubMedQA: A Dataset for Biomedical Research Question AnsweringCode1
How Does BERT Answer Questions? A Layer-Wise Analysis of Transformer RepresentationsCode1
Don't Take the Easy Way Out: Ensemble Based Methods for Avoiding Known Dataset BiasesCode1
Show:102550
← PrevPage 87 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified