SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 80018025 of 10817 papers

TitleStatusHype
Representing Movie Characters in Dialogues0
YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension0
Incorporating Contextual and Syntactic Structures Improves Semantic Similarity Modeling0
IFlyLegal: A Chinese Legal System for Consultation, Law Searching, and Document Analysis0
TextGraphs 2019 Shared Task on Multi-Hop Inference for Explanation Regeneration0
Question Answering Using Hierarchical Attention on Top of BERT Features0
Proceedings of the 2nd Workshop on Machine Reading for Question Answering0
Extractive NarrativeQA with Heuristic Pre-Training0
A Recurrent BERT-based Model for Question GenerationCode0
Team SVMrank: Leveraging Feature-rich Support Vector Machines for Ranking Explanations to Elementary Science Questions0
On Making Reading Comprehension More Comprehensive0
Evaluating Question Answering Evaluation0
CALOR-QUEST : generating a training corpus for Machine Reading Comprehension models from shallow semantic annotations0
Samvaadhana: A Telugu Dialogue System in Hospital Domain0
Answering Naturally: Factoid to Full length Answer GenerationCode0
D-NET: A Pre-Training and Fine-Tuning Framework for Improving the Generalization of Machine Reading ComprehensionCode0
Bend but Don't Break? Multi-Challenge Stress Test for QA Models0
Generalizing Question Answering System with Pre-trained Language Model Fine-tuning0
What Question Answering can Learn from Trivia Nerds0
TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning BaselinesCode0
Do Multi-hop Readers Dream of Reasoning Chains?Code0
A neural document language modeling framework for spoken document retrieval0
Ensembling Strategies for Answering Natural Questions0
Towards Generalizable Neuro-Symbolic Systems for Commonsense Question Answering0
Let Me Know What to Ask: Interrogative-Word-Aware Question Generation0
Show:102550
← PrevPage 321 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified