SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 78267850 of 10817 papers

TitleStatusHype
Interactive Natural Language-based Person SearchCode0
SentenceMIM: A Latent Variable Language ModelCode1
Neural Relation Prediction for Simple Question Answering over Knowledge Graph0
CQ-VQA: Visual Question Answering on Categorized Questions0
Text-based Question Answering from Information Retrieval and Deep Neural Network Perspectives: A Survey0
Enriching Knowledge Bases with Interesting Negative Statements0
Learning Relation Entailment with Structured and Textual InformationCode0
FQuAD: French Question Answering Dataset0
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks0
Transformers as Soft Reasoners over LanguageCode1
Sparse and Structured Visual AttentionCode0
Component Analysis for Visual Question Answering Architectures0
ReClor: A Reading Comprehension Dataset Requiring Logical ReasoningCode1
REALM: Retrieval-Augmented Language Model Pre-TrainingCode1
Self-Attentive Associative MemoryCode1
Description Based Text Classification with Reinforcement Learning0
Translating Web Search Queries into Natural Language Questions0
Generating Biomedical Question Answering Corpora from Q&A forumsCode0
Multimodal fusion of imaging and genomics for lung cancer recurrence predictionCode1
UNCC Biomedical Semantic Question Answering Systems. BioASQ: Task-7B, Phase-B0
Goal-Oriented Multi-Task BERT-Based Dialogue State Tracker0
K-Adapter: Infusing Knowledge into Pre-Trained Models with AdaptersCode1
Web Table Extraction, Retrieval and Augmentation: A Survey0
Teaching Machines to Converse0
Break It Down: A Question Understanding BenchmarkCode1
Show:102550
← PrevPage 314 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified