Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8526–8550 of 10817 papers

Title	Date	Tasks	Status
Explore the Hallucination on Low-level Perception for MLLMs	Sep 15, 2024	HallucinationQuestion Answering	—Unverified
Explore then Determine: A GNN-LLM Synergy Framework for Reasoning over Knowledge Graph	Jun 3, 2024	Knowledge GraphsMultiple-choice	—Unverified
Explore until Confident: Efficient Exploration for Embodied Question Answering	Mar 23, 2024	Conformal PredictionEfficient Exploration	—Unverified
Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison	Feb 20, 2025	DiversityLanguage Modeling	—Unverified
Exploring and Evaluating Personalized Models for Code Generation	Aug 29, 2022	Code GenerationNatural Language Understanding	—Unverified
Exploring BERT Parameter Efficiency on the Stanford Question Answering Dataset v2.0	Feb 25, 2020	Question Answering	—Unverified
Exploring Diverse Expressions for Paraphrase Generation	Nov 1, 2019	DiversityInformation Retrieval	—Unverified
Exploring Diverse Methods in Visual Question Answering	Apr 21, 2024	Question AnsweringVisual Question Answering	—Unverified
Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control	Dec 24, 2024	Question Answering	—Unverified
Exploring Fine-grained Entity Type Constraints for Distantly Supervised Relation Extraction	Aug 1, 2014	Natural Language InferenceQuestion Answering	—Unverified
Exploring Fluent Query Reformulations with Text-to-Text Transformers and Reinforcement Learning	Dec 18, 2020	intent-classificationIntent Classification	—Unverified
Exploring Graph-structured Passage Representation for Multi-hop Reading Comprehension with Graph Neural Networks	Sep 6, 2018	Multi-Hop Reading ComprehensionQuestion Answering	—Unverified
Exploring Human-like Attention Supervision in Visual Question Answering	Sep 19, 2017	Question AnsweringVisual Question Answering	—Unverified
Exploring Hybrid Question Answering via Program-based Prompting	Feb 16, 2024	Code GenerationQuestion Answering	—Unverified
Exploring In-Context Learning Capabilities of Foundation Models for Generating Knowledge Graphs from Text	May 15, 2023	graph constructionIn-Context Learning	—Unverified
Exploring Jailbreak Attacks on LLMs through Intent Concealment and Diversion	May 20, 2025	Question AnsweringText Generation	—Unverified
Exploring Knowledge Boundaries in Large Language Models for Retrieval Judgment	Nov 9, 2024	Question AnsweringRAG	—Unverified
Explicit Utilization of General Knowledge in Machine Reading Comprehension	Sep 10, 2018	General KnowledgeMachine Reading Comprehension	—Unverified
Exploring Markov Logic Networks for Question Answering	Sep 1, 2015	Question Answering	—Unverified
Exploring Neural Net Augmentation to BERT for Question Answering on SQUAD 2.0	Aug 4, 2019	Language ModelingLanguage Modelling	—Unverified
Exploring Question Decomposition for Zero-Shot VQA	Oct 25, 2023	Question AnsweringVisual Question Answering	—Unverified
Exploring Question Understanding and Adaptation in Neural-Network-Based Question Answering	Mar 14, 2017	Question AnsweringReading Comprehension	—Unverified
Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA	Oct 13, 2023	Graph LearningObject	—Unverified
Exploring Spatial Language Grounding Through Referring Expressions	Feb 4, 2025	Image CaptioningNegation	—Unverified
Exploring Temporal Vagueness with Mechanical Turk	Jul 1, 2012	Question AnsweringText Summarization	—Unverified

Show:10 25 50

← PrevPage 342 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified