SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 80768100 of 10817 papers

TitleStatusHype
A Spoken Dialogue System for Spatial Question Answering in a Physical Blocks World0
Learning from Explanations with Neural Execution TreeCode0
BAS: An Answer Selection Method Using BERT Language Model0
Question Answering for Privacy Policies: Combining Computational and Legal PerspectivesCode0
MRNN: A Multi-Resolution Neural Network with Duplex Attention for Document Retrieval in the Context of Question Answering0
Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset0
How to Pre-Train Your Model? Comparison of Different Pre-Training Models for Biomedical Question Answering0
Asking Clarification Questions in Knowledge-Based Question Answering0
Ranking and Sampling in Open-Domain Question Answering0
Question Answering Using Hierarchical Attention on Top of BERT Features0
Fine-tune BERT with Sparse Self-Attention Mechanism0
Finding Generalizable Evidence by Learning to Convince Q\&A Models0
Proceedings of the 2nd Workshop on Machine Reading for Question Answering0
Extractive NarrativeQA with Heuristic Pre-Training0
Exploring Diverse Expressions for Paraphrase Generation0
A Recurrent BERT-based Model for Question GenerationCode0
On Making Reading Comprehension More Comprehensive0
Can You Unpack That? Learning to Rewrite Questions-in-Context0
Evaluating Question Answering Evaluation0
Revisiting the Evaluation of Theory of Mind through Question Answering0
CALOR-QUEST : generating a training corpus for Machine Reading Comprehension models from shallow semantic annotations0
Multi-View Domain Adapted Sentence Embeddings for Low-Resource Unsupervised Duplicate Question Detection0
Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling0
YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension0
Answering Naturally: Factoid to Full length Answer GenerationCode0
Show:102550
← PrevPage 324 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified