SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 51015150 of 10817 papers

TitleStatusHype
CAVE: Correcting Attribute Values in E-commerce ProfilesCode0
PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in FinanceCode1
Plug-and-Play VQA: Zero-shot VQA by Conjoining Large Pretrained Models with Zero TrainingCode0
Adversarial and Safely Scaled Question Generation0
Deep Bidirectional Language-Knowledge Graph PretrainingCode2
ReasonChainQA: Text-based Complex Question Answering with Explainable Evidence Chains0
Vision-Language Pre-training: Basics, Recent Advances, and Future TrendsCode3
RARR: Researching and Revising What Language Models Say, Using Language ModelsCode1
Answer ranking in Community Question Answering: a deep learning approach0
Video in 10 Bits: Few-Bit VideoQA for Efficiency and PrivacyCode2
UniRPG: Unified Discrete Reasoning over Table and Text as Program GenerationCode1
Can Language Representation Models Think in Bets?0
MICO: A Multi-alternative Contrastive Learning Framework for Commonsense Knowledge RepresentationCode1
SQA3D: Situated Question Answering in 3D ScenesCode1
"John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of FeasibilityCode0
TweetNERD -- End to End Entity Linking Benchmark for TweetsCode0
ConEntail: An Entailment-based Framework for Universal Zero and Few Shot Classification with Supervised Contrastive PretrainingCode0
SODAPOP: Open-Ended Discovery of Social Biases in Social Commonsense Reasoning ModelsCode0
Towards End-to-End Open Conversational Machine ReadingCode0
Closed-book Question Generation via Contrastive LearningCode0
MAPL: Parameter-Efficient Adaptation of Unimodal Pre-Trained Models for Vision-Language Few-Shot PromptingCode1
Benchmarking Long-tail Generalization with Likelihood SplitsCode0
Overview of BioASQ 2022: The tenth BioASQ challenge on Large-Scale Biomedical Semantic Indexing and Question Answering0
Shortcomings of Question Answering Based Factuality Frameworks for Error LocalizationCode0
Challenges in Explanation Quality Evaluation0
Towards a Unified Multi-Dimensional Evaluator for Text GenerationCode2
OpenCQA: Open-ended Question Answering with ChartsCode1
Improving Question Answering with Generation of NQ-like Questions0
Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive LearningCode2
CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm0
Are Sample-Efficient NLP Models More Robust?0
SlotFormer: Unsupervised Visual Dynamics Simulation with Object-Centric ModelsCode1
Task Compass: Scaling Multi-task Pre-training with Task PrefixCode1
Relational Graph Convolutional Neural Networks for Multihop Reasoning: A Comparative Study0
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document UnderstandingCode1
Question Answering Over Biological Knowledge Graph via Amazon Alexa0
Discourse Analysis via Questions and Answers: Parsing Dependency Structures of Questions Under DiscussionCode1
Probing Commonsense Knowledge in Pre-trained Language Models with Sense-level Precision and Expanded VocabularyCode0
TwiRGCN: Temporally Weighted Graph Convolution for Question Answering over Temporal Knowledge Graphs0
Context Generation Improves Open Domain Question Answering0
Decoupled Context Processing for Context Augmented Language Modeling0
CSS: Combining Self-training and Self-supervised Learning for Few-shot Dialogue State Tracking0
Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question AnsweringCode1
Capturing Global Structural Information in Long Document Question Answering with Compressive Graph Selector NetworkCode0
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training ModelCode1
Mixed-modality Representation Learning and Pre-training for Joint Table-and-Text Retrieval in OpenQACode1
How Well Do Multi-hop Reading Comprehension Models Understand Date Information?Code0
Contrastive Video-Language Learning with Fine-grained Frame Sampling0
Semantic Framework based Query Generation for Temporal Question Answering over Knowledge Graphs0
Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQACode1
Show:102550
← PrevPage 103 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified