SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 68016825 of 10817 papers

TitleStatusHype
Sequence tagging for biomedical extractive question answeringCode1
Are Multilingual BERT models robust? A Case Study on Adversarial Attacks for Multilingual Question Answering0
Time-Stamped Language Model: Teaching Language Models to Understand the Flow of EventsCode1
UIT-E10dot3 at SemEval-2021 Task 5: Toxic Spans Detection with Named Entity Recognition and Question-Answering Approaches0
Designing a Minimal Retrieve-and-Read System for Open-Domain Question AnsweringCode1
TransferNet: An Effective and Transparent Framework for Multi-hop Question Answering over Relation GraphCode1
How to Train BERT with an Academic BudgetCode1
Toward Deconfounding the Influence of Entity Demographics for Question Answering Accuracy0
TWEAC: Transformer with Extendable QA Agent ClassifiersCode1
Event Detection as Question Answering with Entity InformationCode0
Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention0
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over ImagesCode0
What's in your Head? Emergent Behaviour in Multi-Task Transformer Models0
QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question AnsweringCode1
Multi-Step Reasoning Over Unstructured Text with Beam Dense RetrievalCode1
Structural analysis of an all-purpose question answering model0
Neuro-Symbolic VQA: A review from the perspective of AGI desiderata0
MultiModalQA: Complex Question Answering over Text, Tables and Images0
A Replication Study of Dense Passage RetrieverCode2
SpartQA: : A Textual Question Answering Benchmark for Spatial ReasoningCode1
Object-Centric Representation Learning for Video Question Answering0
Contextualized Knowledge-aware Attentive Neural Network: Enhancing Answer Selection with Knowledge0
Innovative Bert-based Reranking Language Models for Speech Recognition0
Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt CollectionsCode1
TransWiC at SemEval-2021 Task 2: Transformer-based Multilingual and Cross-lingual Word-in-Context Disambiguation0
Show:102550
← PrevPage 273 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified