Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8551–8575 of 10817 papers

Title	Date	Tasks	Status
Exploring the BERT Cross-Lingual Transferability: a Case Study in Reading Comprehension	Jun 17, 2020	NERPOS	—Unverified
Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering	Jul 25, 2017	Answer SelectionQuestion Answering	—Unverified
Exploring the Effectiveness of GPT Models in Test-Taking: A Case Study of the Driver's License Knowledge Test	Aug 22, 2023	Question Answering	—Unverified
Exploring the Effectiveness of Object-Centric Representations in Visual Question Answering: Comparative Insights with Foundation Models	Jul 22, 2024	Question AnsweringRepresentation Learning	—Unverified
Exploring the Feasibility of ChatGPT for Event Extraction	Mar 7, 2023	Event ExtractionMachine Translation	—Unverified
Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions	Feb 20, 2024	Image CaptioningQuestion Answering	—Unverified
Exploring the Ideal Depth of Neural Network when Predicting Question Deletion on Community Question Answering	Dec 8, 2019	Community Question AnsweringInformation Retrieval	—Unverified
Exploring the Impact of Occupational Personas on Domain-Specific QA	May 30, 2025	Question Answering	—Unverified
Exploring the Impact of Table-to-Text Methods on Augmenting LLM-based Question Answering with Domain Hybrid Data	Feb 20, 2024	Question AnsweringRAG	—Unverified
Exploring the In-context Learning Ability of Large Language Model for Biomedical Concept Linking	Jul 3, 2023	In-Context LearningInformation Retrieval	—Unverified
Modeling Uncertainty and Using Post-fusion as Fallback Improves Retrieval Augmented Generation with LLMs	Aug 24, 2023	Answer GenerationOpen-Domain Question Answering	—Unverified
Exploring The Landscape of Distributional Robustness for Question Answering Models	Oct 22, 2022	In-Context LearningQuestion Answering	—Unverified
Do Large Language Models have Shared Weaknesses in Medical Question Answering?	Oct 11, 2023	Medical Question AnsweringQuestion Answering	—Unverified
Exploring the Role of Knowledge Graph-Based RAG in Japanese Medical Question Answering with Small-Scale LLMs	Apr 15, 2025	Medical Question AnsweringQuestion Answering	—Unverified
Exploring The Spatial Reasoning Ability of Neural Models in Human IQ Tests	Apr 11, 2020	Question AnsweringSpatial Reasoning	—Unverified
Exploring the Viability of Synthetic Query Generation for Relevance Prediction	May 19, 2023	Information RetrievalQuestion Answering	—Unverified
How Additional Knowledge can Improve Natural Language Commonsense Question Answering?	Sep 19, 2019	ArticlesLanguage Modeling	—Unverified
Exploring Weaknesses of VQA Models through Attribution Driven Insights	Jun 11, 2020	Question AnsweringVisual Question Answering	—Unverified
EXPR at SemEval-2018 Task 9: A Combined Approach for Hypernym Discovery	Jun 1, 2018	Hypernym DiscoveryInformation Retrieval	—Unverified
Extended Named Entity Recognition API and Its Applications in Language Education	Jul 1, 2017	Dialogue GenerationInformation Retrieval	—Unverified
Extending Neural Question Answering with Linguistic Input Features	Aug 1, 2019	Question Answering	—Unverified
Extending the Frontier of ChatGPT: Code Generation and Debugging	Jul 17, 2023	Code GenerationQuestion Answering	—Unverified
EKTVQA: Generalized use of External Knowledge to empower Scene Text in Text-VQA	Aug 22, 2021	Open-Ended Question AnsweringOptical Character Recognition (OCR)	—Unverified
Extracting Context-Rich Entailment Rules from Wikipedia Revision History	Jul 1, 2012	Natural Language InferenceQuestion Answering	—Unverified
Extracting Definitions and Hypernym Relations relying on Syntactic Dependencies and Support Vector Machines	Aug 1, 2013	Question AnsweringRelation Extraction	—Unverified

Show:10 25 50

← PrevPage 343 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified