SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 82018250 of 10817 papers

TitleStatusHype
CFO: A Framework for Building Production NLP Systems0
Learning Representations and Agents for Information Retrieval0
Abductive Commonsense ReasoningCode0
Multi-class Hierarchical Question Classification for Multiple Choice Science ExamsCode0
Integrating Multimodal Information in Large Pretrained TransformersCode0
A Multi-Type Multi-Span Network for Reading Comprehension that Requires Discrete ReasoningCode0
Reasoning-Driven Question-Answering for Natural Language Understanding0
Reactive Multi-Stage Feature Fusion for Multimodal Dialogue Modeling0
VideoNavQA: Bridging the Gap between Visual and Embodied Question AnsweringCode1
Towards Optimisation of Collaborative Question Answering over Knowledge Graphs0
SG-Net: Syntax-Guided Machine Reading ComprehensionCode0
Fusion of Detected Objects in Text for Visual Question Answering0
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding0
Variational Fusion for Multimodal Sentiment Analysis0
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA SystemCode0
Why Does a Visual Question Have Different Answers?0
AmazonQA: A Review-Based Question Answering TaskCode0
Multimodal Unified Attention Networks for Vision-and-Language Interactions0
Multi-modality Latent Interaction Network for Visual Question Answering0
A Generate-Validate Approach to Answering Questions about Qualitative Relationships0
TEQUILA: Temporal Question Answering over Knowledge Bases0
Question-Agnostic Attention for Visual Question Answering0
CRIC: A VQA Dataset for Compositional Reasoning on Vision and Commonsense0
Mitigating Noisy Inputs for Question Answering0
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language TasksCode1
Answering Questions about Data Visualizations using Efficient Bimodal FusionCode0
Exploring Neural Net Augmentation to BERT for Question Answering on SQUAD 2.00
Extending Neural Question Answering with Linguistic Input Features0
Simple and Effective Text Matching with Richer Alignment FeaturesCode0
ARS\_NITK at MEDIQA 2019:Analysing Various Methods for Natural Language Inference, Recognising Question Entailment and Medical Question Answering System0
ChiMed: A Chinese Medical Corpus for Question AnsweringCode0
Overview of the MEDIQA 2019 Shared Task on Textual Inference, Question Entailment and Question AnsweringCode1
A Paraphrase Generation System for EHR Question Answering0
ANU-CSIRO at MEDIQA 2019: Question Answering Using Deep Contextual Knowledge0
DUT-NLP at MEDIQA 2019: An Adversarial Multi-Task Network to Jointly Model Recognizing Question Entailment and Question Answering0
DUT-BIM at MEDIQA 2019: Utilizing Transformer Network and Medical Domain-Specific Contextualized Representations for Question Answering0
LasigeBioTM at MEDIQA 2019: Biomedical Question Answering using Bidirectional Transformers and Named Entity Recognition0
The Meaning of ``Most'' for Visual Question Answering Models0
Curio SmartChat : A system for Natural Language Question Answering for Self-Paced K-12 Learning0
IIT-KGP at MEDIQA 2019: Recognizing Question Entailment using Sci-BERT stacked with a Gradient Boosting Classifier0
UU\_TAILS at MEDIQA 2019: Learning Textual Entailment in the Medical Domain0
Predicting the Difficulty of Multiple Choice Questions in a High-stakes Medical Exam0
Amharic Question Answering for Biography, Definition, and Description Questions0
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering0
An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation0
LEAF-QA: Locate, Encode & Attend for Figure Question Answering0
Reinforced Dynamic Reasoning for Conversational Question GenerationCode0
ERNIE 2.0: A Continual Pre-training Framework for Language UnderstandingCode3
A mathematical model for universal semanticsCode0
Text-to-SQL Generation for Question Answering on Electronic Medical RecordsCode0
Show:102550
← PrevPage 165 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified