SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 67766800 of 10817 papers

TitleStatusHype
Recent Advances in Automated Question Answering In Biomedical Domain0
MNet-Sim: A Multi-layered Semantic Similarity Network to Evaluate Sentence Similarity0
Ontology-based question answering over corporate structured data0
Visual Question Answering based on Formal Logic0
Grounded Graph Decoding Improves Compositional Generalization in Question AnsweringCode0
Medicines Question Answering System, MeQA0
Reducing the impact of out of vocabulary words in the translation of natural language questions into SPARQL queries0
SERC: Syntactic and Semantic Sequence based Event Relation Classification0
UQuAD1.0: Development of an Urdu Question Answering Training Data for Machine Reading Comprehension0
Clustering Monolingual Vocabularies to Improve Cross-Lingual Generalization0
Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed DomainsCode0
Adapting Entities across Languages and Cultures0
Q^2: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering0
Study of Similarity Measures as Features in Classification for Answer Sentence Selection Task in Hindi Question Answering: Language-Specific v/s Other Measures0
Self Question-answering: Aspect-based Sentiment Analysis by Role Flipped Machine Reading ComprehensionCode0
Textual Time Travel: A Temporally Informed Approach to Theory of Mind0
Relation-aware Bidirectional Path Reasoning for Commonsense Question Answering0
Evaluation Paradigms in Question Answering0
Can Question Generation Debias Question Answering Models? A Case Study on Question–Context Lexical Overlap0
Can predicate-argument relationships be extracted from UD trees?0
A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base0
Using Question Answering Rewards to Improve Abstractive SummarizationCode0
Winnowing Knowledge for Multi-choice Question Answering0
Narrative Embedding: Re-Contextualization Through Attention0
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question AnsweringCode0
Show:102550
← PrevPage 272 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified