SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 19011950 of 10817 papers

TitleStatusHype
TRAR: Routing the Attention Spans in Transformer for Visual Question AnsweringCode1
Pano-AVQA: Grounded Audio-Visual Question Answering on 360deg VideosCode1
Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide ImagesCode1
BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in BanglaCode1
AraGPT2: Pre-Trained Transformer for Arabic Language GenerationCode1
ECONET: Effective Continual Pretraining of Language Models for Event Temporal ReasoningCode1
Improving BERT with Syntax-aware Local AttentionCode1
ERICA: Improving Entity and Relation Understanding for Pre-trained Language Models via Contrastive LearningCode1
A Memory Efficient Baseline for Open Domain Question AnsweringCode1
UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question AnsweringCode1
Detecting Hate Speech in Multi-modal MemesCode1
Syntax-Enhanced Pre-trained ModelCode1
Explaining NLP Models via Minimal Contrastive Editing (MiCE)Code1
LOREN: Logic-Regularized Reasoning for Interpretable Fact VerificationCode1
Overcoming Language Priors with Self-supervised Learning for Visual Question AnsweringCode1
Clinical Temporal Relation Extraction with Probabilistic Soft Logic Regularization and Global InferenceCode1
Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation EmbeddingCode1
ParsiNLU: A Suite of Language Understanding Challenges for PersianCode1
Fusing Context Into Knowledge Graph for Commonsense Question AnsweringCode1
Distilling Knowledge from Reader to Retriever for Question AnsweringCode1
TAP: Text-Aware Pre-training for Text-VQA and Text-CaptionCode1
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractionsCode1
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene UnderstandingCode1
How Can We Know When Language Models Know? On the Calibration of Language Models for Question AnsweringCode1
SeMantic AnsweR Type prediction task (SMART) at ISWC 2020 Semantic Web ChallengeCode1
H-Mem: Harnessing synaptic plasticity with Hebbian Memory NetworksCode1
Just Ask: Learning to Answer Questions from Millions of Narrated VideosCode1
Point and Ask: Incorporating Pointing into Visual Question AnsweringCode1
Answering Ambiguous Questions through Generative Evidence Fusion and Round-Trip PredictionCode1
XTQA: Span-Level Explanations of the Textbook Question AnsweringCode1
Large Scale Multimodal Classification Using an Ensemble of Transformer Models and Co-AttentionCode1
LRTA: A Transparent Neural-Symbolic Reasoning Framework with Modular Supervision for Visual Question AnsweringCode1
EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP ApplicationsCode1
Learning Associative Inference Using Fast Weight MemoryCode1
Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge BasesCode1
NLPGym -- A toolkit for evaluating RL agents on Natural Language Processing TasksCode1
Utilizing Bidirectional Encoder Representations from Transformers for Answer SelectionCode1
VisBERT: Hidden-State Visualizations for TransformersCode1
Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question AnsweringCode1
Disentangling 3D Prototypical Networks For Few-Shot Concept LearningCode1
Context-Aware Answer Extraction in Question AnsweringCode1
EXAMS: A Multi-Subject High School Examinations Dataset for Cross-Lingual and Multilingual Question AnsweringCode1
CharBERT: Character-aware Pre-trained Language ModelCode1
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning StepsCode1
Learning to Contrast the Counterfactual Samples for Robust Visual Question AnsweringCode1
The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation ClassificationCode1
ConceptBert: Concept-Aware Representation for Visual Question AnsweringCode1
Question Answering with Long Multiple-Span AnswersCode1
CliniQG4QA: Generating Diverse Questions for Domain Adaptation of Clinical Question AnsweringCode1
Few-Shot Complex Knowledge Base Question Answering via Meta Reinforcement LearningCode1
Show:102550
← PrevPage 39 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified