SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 80518100 of 10817 papers

TitleStatusHype
Using Local Knowledge Graph Construction to Scale Seq2Seq Models to Multi-Document InputsCode0
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent VariableCode0
SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank EnsembleCode0
Universal Text Representation from BERT: An Empirical Study0
Bridging the Knowledge Gap: Enhancing Question Answering with World and Domain Knowledge0
MLQA: Evaluating Cross-lingual Extractive Question AnsweringCode1
Unsupervised Question Answering for Fact-Checking0
Why can't memory networks read effectively?0
Efficiently Embedding Dynamic Knowledge Graphs0
Answering Complex Open-domain Questions Through Iterative Query GenerationCode0
NumNet: Machine Reading Comprehension with Numerical ReasoningCode0
Enhancing the Transformer with Explicit Relational Encoding for Math Problem SolvingCode0
Structured Pruning of a BERT-based Question Answering Model0
Whatcha lookin' at? DeepLIFTing BERT's Attention in Question AnsweringCode0
Neural Memory Plasticity for Anomaly Detection0
Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge BaseCode0
Multi-modal Deep Analysis for Multimedia0
Learning Analogy-Preserving Sentence Embeddings for Answer Selection0
Multilingual Question Answering from Formatted Text applied to Conversational Agents0
Domain-Relevant Embeddings for Medical Question Similarity0
Modulated Self-attention Convolutional Network for VQA0
Look before you Hop: Conversational Question Answering over Knowledge Graphs Using Judicious Context ExpansionCode0
Multi-hop Question Answering via Reasoning ChainsCode0
Measuring Sentences Similarity: A Survey0
REMIND Your Neural Network to Prevent Catastrophic ForgettingCode0
Multi-level Gated Recurrent Neural Network for Dialog Act Classification0
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighterCode1
基於特徵粒度之訓練策略於中文口語問答系統之應用(A Feature-granularity Training Strategy for Chinese Spoken Question Answering)0
基於深度學習之簡答題問答系統初步探討(A Preliminary Study on Deep Learning-based Short Answer Question Answering System)0
Neural Question Generation using Interrogative Phrases0
BERT for Question Generation0
SegEQA: Video Segmentation Based Visual Attention for Embodied Question Answering0
From Strings to Things: Knowledge-Enabled VQA Model That Can Read and Reason0
MMM: Multi-stage Multi-task Learning for Multi-choice Reading ComprehensionCode0
Improved Word Sense Disambiguation Using Pre-Trained Contextualized Word RepresentationsCode0
Identifying Supporting Facts for Multi-hop Question Answering with Document Graph Networks0
On Incorporating Semantic Prior Knowledge in Deep Learning Through Embedding-Space Constraints0
A Simple and Effective Model for Answering Multi-span QuestionsCode0
A Pilot Study for Chinese SQL Semantic ParsingCode2
Named Entity Recognition System for Sindhi Language0
OpenNRE: An Open and Extensible Toolkit for Neural Relation ExtractionCode2
Distantly Supervised Question ParsingCode0
ALBERT: A Lite BERT for Self-supervised Learning of Language RepresentationsCode2
Overcoming Data Limitation in Medical Visual Question AnsweringCode1
Spoken Conversational Search for General Knowledge0
Compact Trilinear Interaction for Visual Question AnsweringCode0
Why Does the VQA Model Answer No?: Improving Reasoning through Visual and Linguistic Inference0
AlignNet: Self-supervised Alignment Module0
UNITER: Learning UNiversal Image-TExt Representations0
Linguistic Embeddings as a Common-Sense Knowledge Repository: Challenges and Opportunities0
Show:102550
← PrevPage 162 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified