SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 62516275 of 10817 papers

TitleStatusHype
HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation0
Pro-KD: Progressive Distillation by Following the Footsteps of the Teacher0
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation0
Open Domain Question Answering with A Unified Knowledge InterfaceCode1
Towards Transparent Interactive Semantic Parsing via Step-by-Step CorrectionCode0
BBQ: A Hand-Built Bias Benchmark for Question AnsweringCode1
Tracing Origins: Coreference-aware Machine Reading ComprehensionCode1
Attacking Open-domain Question Answering by Injecting MisinformationCode0
A Survey on State-of-the-art Techniques for Knowledge Graphs Construction and Challenges ahead0
MixQG: Neural Question Generation with Mixed Answer TypesCode1
CCQA: A New Web-Scale Question Answering Dataset for Model Pre-TrainingCode1
Can Explanations Be Useful for Calibrating Black Box Models?Code1
Retrieval-guided Counterfactual Generation for QA0
Cross-Lingual Open-Domain Question Answering with Answer Sentence Generation0
Open-Domain Question-Answering for COVID-19 and Other Emergent DomainsCode0
MMIU: Dataset for Visual Intent Understanding in Multimodal Assistants0
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?Code1
Improving Users' Mental Model with Attention-directed Counterfactual Edits0
Systematic Inequalities in Language Technology Performance across the World's LanguagesCode0
ConditionalQA: A Complex Reading Comprehension Dataset with Conditional AnswersCode1
A Survey on Legal Question Answering Systems0
Mention Memory: incorporating textual knowledge into Transformers through entity mention attentionCode0
Attention-guided Generative Models for Extractive Question Answering0
Explainable Fact-checking through Question Answering0
Pano-AVQA: Grounded Audio-Visual Question Answering on 360^ VideosCode1
Show:102550
← PrevPage 251 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified