SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 93519375 of 10817 papers

TitleStatusHype
Learning to Disambiguate by Asking Discriminative Questions0
Robust Computer Algebra, Theorem Proving, and Oracle AI0
ISS-MULT: Intelligent Sample Selection for Multi-Task Learning in Question Answering0
Asking Too Much? The Rhetorical Role of Questions in Political Discourse0
Automatic Question-Answering Using A Deep Similarity Neural Network0
Multi-modal Factorized Bilinear Pooling with Co-Attention Learning for Visual Question AnsweringCode0
MemexQA: Visual Memex Question AnsweringCode0
A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering ModelsCode0
Query expansion techniques for information retrieval: A survey0
Evaluating Natural Language Understanding Services for Conversational Question Answering SystemsCode0
Redundancy Localization for the Conversationalization of Unstructured Responses0
Results of the fifth edition of the BioASQ Challenge0
Macquarie University at BioASQ 5b -- Query-based Summarisation Techniques for Selecting the Ideal Answers0
Modeling Large-Scale Structured Relationships with Shared Memory for Knowledge Base Completion0
Assessing the performance of Olelo, a real-time biomedical question answering application0
Evaluating Feature Extraction Methods for Knowledge-based Biomedical Word Sense Disambiguation0
Mining Social Science Publications for Survey Variables0
A Biomedical Question Answering System in BioASQ 20170
Automatic classification of doctor-patient questions for a virtual patient record query task0
A Multi-strategy Query Processing Approach for Biomedical Question Answering: USTB\_PRIR at BioASQ 2017 Task 5B0
Tackling Biomedical Text Summarization: OAQA at BioASQ 5B0
The Event StoryLine Corpus: A New Benchmark for Causal and Temporal Relation Extraction0
Spectral Graph-Based Method of Multimodal Word Embedding0
Towards Harnessing Memory Networks for Coreference Resolution0
Sentence Alignment using Unfolding Recursive Autoencoders0
Show:102550
← PrevPage 375 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified