SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 61766200 of 10817 papers

TitleStatusHype
Less is More: Pretrain a Strong Siamese Encoder for Dense Text Retrieval Using a Weak DecoderCode1
Narrative Embedding: Re-Contextualization Through Attention0
Large-Scale Relation Learning for Question Answering over Knowledge Bases with Pre-trained Language ModelsCode1
CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization0
Incorporating medical knowledge in BERT for clinical relation extraction0
Coupling Context Modeling with Zero Pronoun Recovering for Document-Level Natural Language GenerationCode0
Improving Query Graph Generation for Complex Question Answering over Knowledge Base0
GANDALF: a General Character Name Description Dataset for Long Fiction0
Neural Natural Logic Inference for Interpretable Question AnsweringCode0
Zero-Shot Clinical Questionnaire Filling From Human-Machine Interactions0
Adapting Entities across Languages and Cultures0
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question AnsweringCode0
Self Question-answering: Aspect-based Sentiment Analysis by Role Flipped Machine Reading ComprehensionCode0
Can NLI Models Verify QA Systems’ Predictions?Code1
A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base0
Unseen Entity Handling in Complex Question Answering over Knowledge Base via Language Generation0
MIRTT: Learning Multimodal Interaction Representations from Trilinear Transformers for Visual Question AnsweringCode0
Diversity and Consistency: Exploring Visual Question-Answer Pair Generation0
KERS: A Knowledge-Enhanced Framework for Recommendation Dialog Systems with Multiple SubgoalsCode0
AutoEQA: Auto-Encoding Questions for Extractive Question Answering0
A Divide-And-Conquer Approach for Multi-label Multi-hop Relation Detection in Knowledge Base Question Answering0
Using Question Answering Rewards to Improve Abstractive SummarizationCode0
A Multi-label Multi-hop Relation Detection Model based on Relation-aware Sequence Generation0
Textual Time Travel: A Temporally Informed Approach to Theory of Mind0
Aspect-based Sentiment Analysis in Question Answering ForumsCode0
Show:102550
← PrevPage 248 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified