SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 62516300 of 10817 papers

TitleStatusHype
HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation0
Open Domain Question Answering with A Unified Knowledge InterfaceCode1
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation0
Pro-KD: Progressive Distillation by Following the Footsteps of the Teacher0
Towards Transparent Interactive Semantic Parsing via Step-by-Step CorrectionCode0
Tracing Origins: Coreference-aware Machine Reading ComprehensionCode1
MixQG: Neural Question Generation with Mixed Answer TypesCode1
BBQ: A Hand-Built Bias Benchmark for Question AnsweringCode1
Attacking Open-domain Question Answering by Injecting MisinformationCode0
A Survey on State-of-the-art Techniques for Knowledge Graphs Construction and Challenges ahead0
CCQA: A New Web-Scale Question Answering Dataset for Model Pre-TrainingCode1
Can Explanations Be Useful for Calibrating Black Box Models?Code1
Retrieval-guided Counterfactual Generation for QA0
Cross-Lingual Open-Domain Question Answering with Answer Sentence Generation0
Open-Domain Question-Answering for COVID-19 and Other Emergent DomainsCode0
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?Code1
Systematic Inequalities in Language Technology Performance across the World's LanguagesCode0
MMIU: Dataset for Visual Intent Understanding in Multimodal Assistants0
Improving Users' Mental Model with Attention-directed Counterfactual Edits0
ConditionalQA: A Complex Reading Comprehension Dataset with Conditional AnswersCode1
A Survey on Legal Question Answering Systems0
Mention Memory: incorporating textual knowledge into Transformers through entity mention attentionCode0
Attention-guided Generative Models for Extractive Question Answering0
Explainable Fact-checking through Question Answering0
Beyond Accuracy: A Consolidated Tool for Visual Question Answering BenchmarkingCode0
Pano-AVQA: Grounded Audio-Visual Question Answering on 360^ VideosCode1
AskMe: Joint Individual-level and Community-level Behavior Interaction for Question Recommendation0
What Makes Sentences Semantically Related: A Textual Relatedness Dataset and Empirical StudyCode1
Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization0
Distantly-Supervised Evidence Retrieval Enables Question Answering without Evidence AnnotationCode1
The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design0
A Framework for Rationale Extraction for Deep QA models0
A Few More Examples May Be Worth Billions of ParametersCode1
KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering0
Multi-tasking Dialogue Comprehension with Discourse ParsingCode0
A Comparative Study of Transformer-Based Language Models on Extractive Question Answering0
Towards Continual Knowledge Learning of Language ModelsCode1
Noisy Text Data: Achilles' Heel of popular transformer based NLP models0
GNN is a Counter? Revisiting GNN for Question Answering0
Coarse-to-Fine Reasoning for Visual Question AnsweringCode1
COVIDRead: A Large-scale Question Answering Dataset on COVID-190
EntQA: Entity Linking as Question AnsweringCode1
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering0
Perhaps PTLMs Should Go to School -- A Task to Assess Open Book and Closed Book QA0
Counterfactual Samples Synthesizing and Training for Robust Visual Question AnsweringCode1
Asking questions on handwritten document collections0
TopiOCQA: Open-domain Conversational Question Answering with Topic SwitchingCode1
Generating User-Centred Explanations via Illocutionary Question Answering: From Philosophy to InterfacesCode0
Perhaps PTLMs Should Go to School – A Task to Assess Open Book and Closed Book QA0
The Spoon Is in the Sink: Assisting Visually Impaired People in the KitchenCode1
Show:102550
← PrevPage 126 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified