SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 75767600 of 10817 papers

TitleStatusHype
Unsupervised Multiple Choices Question Answering: Start Learning from Basic KnowledgeCode0
Knowledge Distillation for Improved Accuracy in Spoken Question Answering0
Is Retriever Merely an Approximator of Reader?0
Contextualized Attention-based Knowledge Transfer for Spoken Conversational Question Answering0
S_Covid: An Engine to Explore COVID-19 Scientific Literature0
Extracting Procedural Knowledge from Technical Documents0
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved ConsistencyCode0
Pushing the Limits of AMR Parsing with Self-Learning0
Technical Question Answering across Tasks and DomainsCode0
Better Distractions: Transformer-based Distractor Generation and Multiple Choice Question Filtering0
BERTnesia: Investigating the capture and forgetting of knowledge in BERTCode0
Understanding Unnatural Questions Improves Reasoning over Text0
Summary-Oriented Question Generation for Informational Queries0
QuesBELM: A BERT based Ensemble Language Model for Natural Questions0
Towards Data Distillation for End-to-end Spoken Conversational Question Answering0
Towards Interpreting BERT for Reading Comprehension Based QACode0
Hierarchical Conditional Relation Networks for Multimodal Video Question Answering0
Question Answering over Knowledge Base using Language Model Embeddings0
Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering0
Hierarchical Multitask Learning Approach for BERT0
New Ideas and Trends in Deep Multimodal Content Understanding: A Review0
QA2Explanation: Generating and Evaluating Explanations for Question Answering Systems over Knowledge Graph0
Human Adversarial QA: Did the Model Understand the Paragraph?0
RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering0
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks0
Show:102550
← PrevPage 304 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified