SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 48264850 of 10817 papers

TitleStatusHype
Towards Answering Climate Questionnaires from Unstructured Climate ReportsCode0
Recommending Root-Cause and Mitigation Steps for Cloud Incidents using Large Language Models0
Language Models sounds the Death Knell of Knowledge Graphs0
There is No Big Brother or Small Brother: Knowledge Infusion in Language Models for Link Prediction and Question AnsweringCode0
MAQA: A Multimodal QA Benchmark for Negation0
Mind Reasoning Manners: Enhancing Type Perception for Generalized Zero-shot Logical Reasoning over TextCode1
A Brain-inspired Memory Transformation based Differentiable Neural Computer for Reasoning-based Question Answering0
Knowledge Reasoning via Jointly Modeling Knowledge Graphs and Soft Rules0
RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm0
Adaptively Clustering Neighbor Elements for Image-Text GenerationCode0
Topic Segmentation Model Focusing on Local Context0
Emotion-Cause Pair Extraction as Question Answering0
SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout GraphCode1
Learning Trajectory-Word Alignments for Video-Language Tasks0
PIE-QG: Paraphrased Information Extraction for Unsupervised Question Generation from Small Corpora0
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-ShotCode4
Exploring Temporal Concurrency for Video-Language Representation LearningCode0
Variational Causal Inference Network for Explanatory Visual Question AnsweringCode1
PromptCap: Prompt-Guided Image Captioning for VQA with GPT-30
Knowledge Proxy Intervention for Deconfounded Video Question Answering0
Toward Multi-Granularity Decision-Making: Explicit Visual Reasoning with Hierarchical KnowledgeCode0
Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering0
IS-GGT: Iterative Scene Graph Generation With Generative Transformers0
From Images to Textual Prompts: Zero-Shot Visual Question Answering With Frozen Large Language Models0
Exploring the Effect of Primitives for Compositional Generalization in Vision-and-LanguageCode0
Show:102550
← PrevPage 194 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified