SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 80518075 of 10817 papers

TitleStatusHype
Using Local Knowledge Graph Construction to Scale Seq2Seq Models to Multi-Document InputsCode0
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent VariableCode0
Universal Text Representation from BERT: An Empirical Study0
SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank EnsembleCode0
Bridging the Knowledge Gap: Enhancing Question Answering with World and Domain Knowledge0
MLQA: Evaluating Cross-lingual Extractive Question AnsweringCode1
Unsupervised Question Answering for Fact-Checking0
Why can't memory networks read effectively?0
Efficiently Embedding Dynamic Knowledge Graphs0
Answering Complex Open-domain Questions Through Iterative Query GenerationCode0
NumNet: Machine Reading Comprehension with Numerical ReasoningCode0
Enhancing the Transformer with Explicit Relational Encoding for Math Problem SolvingCode0
Structured Pruning of a BERT-based Question Answering Model0
Whatcha lookin' at? DeepLIFTing BERT's Attention in Question AnsweringCode0
Neural Memory Plasticity for Anomaly Detection0
Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge BaseCode0
Multi-modal Deep Analysis for Multimedia0
Learning Analogy-Preserving Sentence Embeddings for Answer Selection0
Multilingual Question Answering from Formatted Text applied to Conversational Agents0
Domain-Relevant Embeddings for Medical Question Similarity0
Modulated Self-attention Convolutional Network for VQA0
Look before you Hop: Conversational Question Answering over Knowledge Graphs Using Judicious Context ExpansionCode0
Multi-hop Question Answering via Reasoning ChainsCode0
Measuring Sentences Similarity: A Survey0
REMIND Your Neural Network to Prevent Catastrophic ForgettingCode0
Show:102550
← PrevPage 323 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified