SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 51265150 of 10817 papers

TitleStatusHype
Towards a Unified Multi-Dimensional Evaluator for Text GenerationCode2
OpenCQA: Open-ended Question Answering with ChartsCode1
Improving Question Answering with Generation of NQ-like Questions0
Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive LearningCode2
CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm0
Are Sample-Efficient NLP Models More Robust?0
SlotFormer: Unsupervised Visual Dynamics Simulation with Object-Centric ModelsCode1
Task Compass: Scaling Multi-task Pre-training with Task PrefixCode1
Relational Graph Convolutional Neural Networks for Multihop Reasoning: A Comparative Study0
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document UnderstandingCode1
Question Answering Over Biological Knowledge Graph via Amazon Alexa0
Discourse Analysis via Questions and Answers: Parsing Dependency Structures of Questions Under DiscussionCode1
Probing Commonsense Knowledge in Pre-trained Language Models with Sense-level Precision and Expanded VocabularyCode0
TwiRGCN: Temporally Weighted Graph Convolution for Question Answering over Temporal Knowledge Graphs0
Context Generation Improves Open Domain Question Answering0
Decoupled Context Processing for Context Augmented Language Modeling0
CSS: Combining Self-training and Self-supervised Learning for Few-shot Dialogue State Tracking0
Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question AnsweringCode1
Capturing Global Structural Information in Long Document Question Answering with Compressive Graph Selector NetworkCode0
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training ModelCode1
Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQACode1
How Well Do Multi-hop Reading Comprehension Models Understand Date Information?Code0
Contrastive Video-Language Learning with Fine-grained Frame Sampling0
Semantic Framework based Query Generation for Temporal Question Answering over Knowledge Graphs0
Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQACode1
Show:102550
← PrevPage 206 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified