SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 82018225 of 10817 papers

TitleStatusHype
CFO: A Framework for Building Production NLP Systems0
Learning Representations and Agents for Information Retrieval0
Abductive Commonsense ReasoningCode0
Multi-class Hierarchical Question Classification for Multiple Choice Science ExamsCode0
Integrating Multimodal Information in Large Pretrained TransformersCode0
A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete ReasoningCode0
Reasoning-Driven Question-Answering for Natural Language Understanding0
Reactive Multi-Stage Feature Fusion for Multimodal Dialogue Modeling0
VideoNavQA: Bridging the Gap between Visual and Embodied Question AnsweringCode1
Towards Optimisation of Collaborative Question Answering over Knowledge Graphs0
SG-Net: Syntax-Guided Machine Reading ComprehensionCode0
Fusion of Detected Objects in Text for Visual Question Answering0
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding0
Variational Fusion for Multimodal Sentiment Analysis0
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA SystemCode0
Why Does a Visual Question Have Different Answers?0
AmazonQA: A Review-Based Question Answering TaskCode0
Multimodal Unified Attention Networks for Vision-and-Language Interactions0
Multi-modality Latent Interaction Network for Visual Question Answering0
TEQUILA: Temporal Question Answering over Knowledge Bases0
A Generate-Validate Approach to Answering Questions about Qualitative Relationships0
Question-Agnostic Attention for Visual Question Answering0
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense0
Mitigating Noisy Inputs for Question Answering0
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language TasksCode1
Show:102550
← PrevPage 329 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified