SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 62016225 of 10817 papers

TitleStatusHype
Aspect-based Sentiment Analysis in Question Answering ForumsCode0
A Free Format Legal Question Answering System0
Learning from Limited Labels for Long Legal Dialogue0
Relation-aware Bidirectional Path Reasoning for Commonsense Question Answering0
Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed DomainsCode0
Understanding the Extent to which Content Quality Metrics Measure the Information Quality of Summaries0
Can predicate-argument relationships be extracted from UD trees?0
A Transformer Based Approach towards Identification of Discourse Unit Segments and Connectives0
Enhanced Language Representation with Label Knowledge for Span ExtractionCode1
Discourse Comprehension: A Question Answering Framework to Represent Sentence ConnectionsCode0
Introspective Distillation for Robust Question AnsweringCode1
Text Classification for Task-based Source Code Related Questions0
DSC-IITISM at FinCausal 2021: Combining POS tagging with Attention-based Contextual Representations for Identifying Causal Relationships in Financial Documents0
Learning Representations for Zero-Shot Retrieval over Structured Data0
On the Feasibility of Predicting Questions being Forgotten in Stack Overflow0
MetaICL: Learning to Learn In ContextCode1
Path-Enhanced Multi-Relational Question Answering with Knowledge Graph Embeddings0
Multi-stage Clarification in Conversational AI: The case of Question-Answering Dialogue Systems0
Dense Hierarchical Retrieval for Open-Domain Question AnsweringCode1
What makes us curious? analysis of a corpus of open-domain questions0
Ask me in your own words: paraphrasing for multitask question answeringCode0
Perceptual Score: What Data Modalities Does Your Model Perceive?Code0
SQALER: Scaling Question Answering by Decoupling Multi-Hop and Logical Reasoning0
Meta-Knowledge Transfer for Inductive Knowledge Graph EmbeddingCode1
Transferring Domain-Agnostic Knowledge in Video Question Answering0
Show:102550
← PrevPage 249 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified