SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 79267950 of 10817 papers

TitleStatusHype
Knowledge Graphs and Knowledge Networks: The Story in Brief0
Practical Annotation Strategies for Question Answering Datasets0
Noise Estimation Using Density Estimation for Self-Supervised Multimodal LearningCode0
Natural Language QA Approaches using Reasoning with External Knowledge0
Uncovering Hidden Semantics of Set Information in Knowledge BasesCode0
A Study on Efficiency, Accuracy and Document Structure for Answer Sentence Selection0
A Question-Centric Model for Visual Question Answering in Medical ImagingCode0
A Study on Multimodal and Interactive Explanations for Visual Question Answering0
DC-BERT: Decoupling Question and Document for Efficient Contextual Encoding0
Unshuffling Data for Improved Generalization0
Masking Orchestration: Multi-task Pretraining for Multi-role Dialogue Representation LearningCode0
Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT0
Generating Followup Questions for Interpretable Multi-hop Question Answering0
Exploring BERT Parameter Efficiency on the Stanford Question Answering Dataset v2.00
Multimodal Transformer with Pointer Network for the DSTC8 AVSD Challenge0
End-to-End Entity Linking and Disambiguation leveraging Word and Knowledge Graph Embeddings0
FONDUE: A Framework for Node Disambiguation Using Network Embeddings0
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering0
Do Multi-Hop Question Answering Systems Know How to Answer the Single-Hop Sub-Questions?0
Training Question Answering Models From Synthetic Data0
Is Aligning Embedding Spaces a Challenging Task? A Study on Heterogeneous Embedding Alignment Methods0
VQA-LOL: Visual Question Answering under the Lens of Logic0
Interactive Natural Language-based Person SearchCode0
Neural Relation Prediction for Simple Question Answering over Knowledge Graph0
CQ-VQA: Visual Question Answering on Categorized Questions0
Show:102550
← PrevPage 318 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified