SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 86018650 of 10817 papers

TitleStatusHype
Answering Science Exam Questions Using Query Reformulation with Background Knowledge0
On Generality and Knowledge Transferability in Cross-Domain Duplicate Question Detection for Heterogeneous Community Question Answering0
Exploiting Sentence Embedding for Medical Question Answering0
End-to-End Learning for Answering Structured Queries Directly over Text0
Improving Skin Condition Classification with a Question Answering Model0
Cross-lingual Short-text Matching with Deep Learning0
Translating Natural Language to SQL using Pointer-Generator Networks and How Decoding Order Matters0
Blindfold Baselines for Embodied QACode0
An Introductory Survey on Attention Mechanisms in NLP Problems0
Holistic Multi-modal Memory Network for Movie Question Answering0
CQASUMM: Building References for Community Question Answering Summarization CorporaCode0
ReDecode Framework for Iterative Improvement in Paraphrase Generation0
Densely Connected Attention Propagation for Reading ComprehensionCode1
Adversarial Sampling and Training for Semi-Supervised Information Retrieval0
Encoding Implicit Relation Requirements for Relation Extraction: A Joint Inference Approach0
Towards Compositional Distributional Discourse Analysis0
Compositional Language Understanding with Text-based Relational ReasoningCode0
Parser Extraction of Triples in Unstructured TextCode0
Improving Span-based Question Answering Systems with Coarsely Labeled Data0
ATP: Directed Graph Embedding with Asymmetric Transitivity PreservationCode0
CommonsenseQA: A Question Answering Challenge Targeting Commonsense KnowledgeCode1
Learning to Rank Query Graphs for Complex Question Answering over Knowledge GraphsCode0
Zero-Shot Transfer VQA Dataset0
Automatic Opinion Question GenerationCode0
Exploiting Attention to Reveal Shortcomings in Memory Models0
MindLab Neural Network Approach at BioASQ 6B0
Frame- and Entity-Based Knowledge for Common-Sense Argumentative ReasoningCode0
Extraction Meets Abstraction: Ideal Answer Generation for Biomedical Questions0
An Adaption of BIOASQ Question Answering dataset for Machine Reading systems by Manual Annotations of Answer Spans.0
Proceedings of the 6th BioASQ Workshop A challenge on large-scale biomedical semantic indexing and question answering0
Macquarie University at BioASQ 6b: Deep learning and deep reinforcement learning for query-based summarisation0
How much should you ask? On the question structure in QA systems.0
Preferred Answer Selection in Stack Overflow: Better Text Representations ... and Metadata, Metadata, Metadata0
AttentionMeSH: Simple, Effective and Interpretable Automatic MeSH Indexer0
Ontology-Based Retrieval \& Neural Approaches for BioASQ Ideal Answer Generation0
Results of the sixth edition of the BioASQ Challenge0
Expletives in Universal Dependency TreebanksCode0
Explaining non-linear Classifier Decisions within Kernel-based Deep Architectures0
Team SWEEPer: Joint Sentence Extraction and Fact Checking with Pointer Networks0
Zero-shot Relation Classification as Textual Entailment0
Semantic role labeling tools for biomedical question answering: a study of selected tools on the BioASQ datasets0
SIRIUS-LTG: An Entity Linking Approach to Fact Extraction and Verification0
Retrieve and Re-rank: A Simple and Effective IR Approach to Simple Question Answering over Knowledge Graphs0
UNCC QA: Biomedical Question Answering system0
OpenKE: An Open Toolkit for Knowledge EmbeddingCode0
An Interactive Web-Interface for Visualizing the Inner Workings of the Question Answering LSTM0
CogCompTime: A Tool for Understanding Time in Natural Language0
Interactive Instance-based Evaluation of Knowledge Base Question AnsweringCode0
On the Generation of Medical Question-Answer Pairs0
Shifting the Baseline: Single Modality Performance on Visual Navigation & QA0
Show:102550
← PrevPage 173 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified