SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 61516175 of 10817 papers

TitleStatusHype
Grounded Graph Decoding Improves Compositional Generalization in Question AnsweringCode0
Medicines Question Answering System, MeQA0
Reducing the impact of out of vocabulary words in the translation of natural language questions into SPARQL queries0
SERC: Syntactic and Semantic Sequence based Event Relation Classification0
UQuAD1.0: Development of an Urdu Question Answering Training Data for Machine Reading Comprehension0
Callee: Recovering Call Graphs for Binaries with Transfer and Contrastive LearningCode1
ViVQA: Vietnamese Visual Question AnsweringCode1
Study of Similarity Measures as Features in Classification for Answer Sentence Selection Task in Hindi Question Answering: Language-Specific v/s Other Measures0
Multi-Domain Multilingual Question AnsweringCode1
ConQuest: Contextual Question Paraphrasing through Answer-Aware Synthetic Question Generation0
Clustering Monolingual Vocabularies to Improve Cross-Lingual Generalization0
The Global Banking Standards QA Dataset (GBS-QA)0
A Fact Checking and Verification System for FEVEROUS Using a Zero-Shot Learning Approach0
What’s in a Name? Answer Equivalence For Open-Domain Question AnsweringCode0
Evaluation Paradigms in Question Answering0
Distantly-Supervised Dense Retrieval Enables Open-Domain Question Answering without Evidence AnnotationCode1
MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering DatasetCode1
Q^2: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering0
Have You Seen That Number? Investigating Extrapolation in Question Answering Models0
What’s in Your Head? Emergent Behaviour in Multi-Task Transformer Models0
Can Question Generation Debias Question Answering Models? A Case Study on Question–Context Lexical Overlap0
Eliciting Bias in Question Answering Models through AmbiguityCode0
Toward Deconfounding the Effect of Entity Demographics for Question Answering Accuracy0
Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval0
SOM-NCSCM : An Efficient Neural Chinese Sentence Compression Model Enhanced with Self-Organizing Map0
Show:102550
← PrevPage 247 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified