SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 68516900 of 10817 papers

TitleStatusHype
C-MORE: Pretraining to Answer Open-Domain Questions by Consulting Millions of References0
Challenges in Generalization in Open Domain Question Answering0
Explore before Moving: A Feasible Path Estimation and Memory Recalling Framework for Embodied Navigation0
Explainable Assessment of Healthcare Articles with QA0
Evaluation of Transfer Learning for Polish with a text-to-text model0
Echo-Attention: Attend Once and Get N Attentions for Free0
A Copy-Augmented Generative Model for Open-Domain Question Answering0
Ditch the Gold Standard: Re-evaluating Conversational Question Answering0
Learning Rich Representation of Keyphrases from Text0
xGQA: Cross-Lingual Visual Question Answering0
Cross-lingual Inference with A Chinese Entailment Graph0
Cooperative Semi-Supervised Transfer Learning of Machine Reading Comprehension0
Pro-KD: Progressive Distillation by Following the Footsteps of the Teacher0
HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation0
Semantic Search as Extractive Paraphrase Span Detection0
Towards Transparent Interactive Semantic Parsing via Step-by-Step CorrectionCode0
Attacking Open-domain Question Answering by Injecting MisinformationCode0
A Survey on State-of-the-art Techniques for Knowledge Graphs Construction and Challenges ahead0
Retrieval-guided Counterfactual Generation for QA0
Cross-Lingual Open-Domain Question Answering with Answer Sentence Generation0
Open-Domain Question-Answering for COVID-19 and Other Emergent DomainsCode0
MMIU: Dataset for Visual Intent Understanding in Multimodal Assistants0
Systematic Inequalities in Language Technology Performance across the World's LanguagesCode0
Improving Users' Mental Model with Attention-directed Counterfactual Edits0
Mention Memory: incorporating textual knowledge into Transformers through entity mention attentionCode0
Attention-guided Generative Models for Extractive Question Answering0
A Survey on Legal Question Answering Systems0
Explainable Fact-checking through Question Answering0
Beyond Accuracy: A Consolidated Tool for Visual Question Answering BenchmarkingCode0
AskMe: Joint Individual-level and Community-level Behavior Interaction for Question Recommendation0
Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization0
A Framework for Rationale Extraction for Deep QA models0
The Inductive Bias of In-Context Learning: Rethinking Pretraining Example Design0
KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question Answering0
Multi-tasking Dialogue Comprehension with Discourse ParsingCode0
A Comparative Study of Transformer-Based Language Models on Extractive Question Answering0
Noisy Text Data: Achilles' Heel of popular transformer based NLP models0
GNN is a Counter? Revisiting GNN for Question Answering0
COVIDRead: A Large-scale Question Answering Dataset on COVID-190
Perhaps PTLMs Should Go to School -- A Task to Assess Open Book and Closed Book QA0
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering0
Asking questions on handwritten document collections0
Generating User-Centred Explanations via Illocutionary Question Answering: From Philosophy to InterfacesCode0
Perhaps PTLMs Should Go to School – A Task to Assess Open Book and Closed Book QA0
A Flexible and Extensible Framework for Multiple Answer Modes Question Answering0
Self-Attentive Constituency Parsing for UCCA-based Semantic Parsing0
Generating Justifications in a Spatial Question-Answering Dialogue System for a Blocks World0
Breaking Down Questions for Outside-Knowledge VQA0
Variational Disentangled Attention for Regularized Visual Dialog0
Measuring CLEVRness: Black-box Testing of Visual Reasoning Models0
Show:102550
← PrevPage 138 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified