Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3571–3580 of 10817 papers

Title	Date	Tasks	Status	Score
Improving Retrieval Augmented Open-Domain Question-Answering with Vectorized Contexts	Apr 2, 2024	In-Context LearningLanguage Modeling	CodeCode Available	5
Improving the Cross-Lingual Generalisation in Visual Question Answering	Sep 7, 2022	Cross-Lingual TransferQuestion Answering	CodeCode Available	5
Selective Question Answering under Domain Shift	Jun 16, 2020	Question Answering	CodeCode Available	5
Selective Token Generation for Few-shot Natural Language Generation	Sep 17, 2022	Data-to-Text GenerationLanguage Modeling	CodeCode Available	5
Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts	Nov 15, 2023	Question AnsweringSentence	CodeCode Available	5
Improving Quality of a Post's Set of Answers in Stack Overflow	May 30, 2020	Community Question AnsweringQuestion Answering	CodeCode Available	5
Attend What You Need: Motion-Appearance Synergistic Networks for Video Question Answering	Jun 19, 2021	AI AgentQuestion Answering	CodeCode Available	5
Expeditious Generation of Knowledge Graph Embeddings	Mar 21, 2018	Graph EmbeddingKnowledge Graph Completion	CodeCode Available	5
Improving Question Answering over Incomplete KBs with Knowledge-Aware Reader	May 17, 2019	Question Answering	CodeCode Available	5
Core Tokensets for Data-efficient Sequential Training of Transformers	Oct 8, 2024	Image Captioningimage-classification	CodeCode Available	5

Show:10 25 50

← PrevPage 358 of 1082Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified