SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 79768000 of 10817 papers

TitleStatusHype
Towards Domain Adaptation from Limited Data for Question Answering Using Deep Neural Networks0
Unsupervised Domain Adaptation of Contextual Embeddings for Low-Resource Duplicate Question Detection0
Multi-Paragraph Reasoning with Knowledge-enhanced Graph Neural Network0
Learning to Answer by Learning to Ask: Getting the Best of GPT-2 and BERT Worlds0
A Spoken Dialogue System for Spatial Question Answering in a Physical Blocks World0
BAS: An Answer Selection Method Using BERT Language Model0
Learning from Explanations with Neural Execution TreeCode0
Question Answering for Privacy Policies: Combining Computational and Legal PerspectivesCode0
MRNN: A Multi-Resolution Neural Network with Duplex Attention for Document Retrieval in the Context of Question Answering0
Scene Graph based Image Retrieval -- A case study on the CLEVR Dataset0
How to Pre-Train Your Model? Comparison of Different Pre-Training Models for Biomedical Question Answering0
Asking Clarification Questions in Knowledge-Based Question Answering0
Ranking and Sampling in Open-Domain Question Answering0
Fine-tune BERT with Sparse Self-Attention Mechanism0
Finding Generalizable Evidence by Learning to Convince Q\&A Models0
Social IQa: Commonsense Reasoning about Social Interactions0
Exploring Diverse Expressions for Paraphrase Generation0
Revisiting the Evaluation of Theory of Mind through Question Answering0
Can You Unpack That? Learning to Rewrite Questions-in-Context0
Multi-View Domain Adapted Sentence Embeddings for Low-Resource Unsupervised Duplicate Question Detection0
YouMakeup: A Large-Scale Domain-Specific Multimodal Dataset for Fine-Grained Semantic Comprehension0
Bridging the Gap between Relevance Matching and Semantic Matching for Short Text Similarity Modeling0
MICRON: Multigranular Interaction for Contextualizing RepresentatiON in Non-factoid Question Answering0
Memory Graph Networks for Explainable Memory-grounded Question Answering0
Video Dialog via Progressive Inference and Cross-Transformer0
Show:102550
← PrevPage 320 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified