SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 68016850 of 10817 papers

TitleStatusHype
Are Multilingual BERT models robust? A Case Study on Adversarial Attacks for Multilingual Question Answering0
Time-Stamped Language Model: Teaching Language Models to Understand the Flow of EventsCode1
UIT-E10dot3 at SemEval-2021 Task 5: Toxic Spans Detection with Named Entity Recognition and Question-Answering Approaches0
Toward Deconfounding the Influence of Entity Demographics for Question Answering Accuracy0
Designing a Minimal Retrieve-and-Read System for Open-Domain Question AnsweringCode1
TransferNet: An Effective and Transparent Framework for Multi-hop Question Answering over Relation GraphCode1
How to Train BERT with an Academic BudgetCode1
Sequence tagging for biomedical extractive question answeringCode1
TWEAC: Transformer with Extendable QA Agent ClassifiersCode1
Event Detection as Question Answering with Entity InformationCode0
Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention0
What's in your Head? Emergent Behaviour in Multi-Task Transformer Models0
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over ImagesCode0
QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question AnsweringCode1
Multi-Step Reasoning Over Unstructured Text with Beam Dense RetrievalCode1
Structural analysis of an all-purpose question answering model0
MultiModalQA: Complex Question Answering over Text, Tables and Images0
Neuro-Symbolic VQA: A review from the perspective of AGI desiderata0
A Replication Study of Dense Passage RetrieverCode2
SpartQA: : A Textual Question Answering Benchmark for Spatial ReasoningCode1
Contextualized Knowledge-aware Attentive Neural Network: Enhancing Answer Selection with Knowledge0
Object-Centric Representation Learning for Video Question Answering0
Innovative Bert-based Reranking Language Models for Speech Recognition0
Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt CollectionsCode1
TransWiC at SemEval-2021 Task 2: Transformer-based Multilingual and Cross-lingual Word-in-Context Disambiguation0
UPB at SemEval-2021 Task 8: Extracting Semantic Information on Measurements as Multi-Turn Question Answering0
FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation FrameworkCode0
PQA: Perceptual Question AnsweringCode0
Video Question Answering with Phrases via Semantic Roles0
How Transferable are Reasoning Patterns in VQA?0
Multimodal Continuous Visual Attention Mechanisms0
Distantly Supervised Transformers For E-Commerce Product QA0
A Question-answering Based Framework for Relation Extraction Validation0
Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question AnsweringCode1
CBench: Towards Better Evaluation of Question Answering Over Knowledge GraphsCode1
Paired Examples as Indirect Supervision in Latent Decision Models0
Compressing Visual-linguistic Model via Knowledge Distillation0
Conversational Question Answering over Knowledge Graphs with Transformer and Graph Attention NetworksCode1
ASPER: Attention-based Approach to Extract Syntactic Patterns denoting Semantic Relations in Sentential Context0
MMBERT: Multimodal BERT Pretraining for Improved Medical VQACode1
Type Prediction Systems0
VisQA: X-raying Vision and Language Reasoning in TransformersCode1
A View From the Crowd: Evaluation Challenges for Time-Offset Interaction Applications0
Exploring Neural Language Models via Analysis of Local and Global Self-Attention SpacesCode0
Automatically Generating Cause-and-Effect Questions from PassagesCode1
MultiReQA: A Cross-Domain Evaluation forRetrieval Question Answering ModelsCode1
NLQuAD: A Non-Factoid Long Question Answering Data SetCode1
Retrieval, Re-ranking and Multi-task Learning for Knowledge-Base Question Answering0
Structural Encoding and Pre-training Matter: Adapting BERT for Table-Based Fact Verification0
Knowledge Base Question Answering through Recursive Hypergraphs0
Show:102550
← PrevPage 137 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified