Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10076–10100 of 10817 papers

Title	Date	Tasks	Status
Similarity-Based Reconstruction Loss for Meaning Representation	Oct 1, 2018	Dialogue GenerationDiversity	—Unverified
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction	Jun 28, 2022	AttributeAttribute Extraction	—Unverified
Simple and Effective Semi-Supervised Question Answering	Apr 2, 2018	Extractive Question-AnsweringQuestion Answering	—Unverified
Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval	Nov 1, 2021	Open-Domain Question AnsweringPassage Retrieval	—Unverified
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps	Dec 9, 2020	DecoderImage Captioning	—Unverified
Simple Large-scale Relation Extraction from Unstructured Text	Mar 24, 2018	Question AnsweringRelation	—Unverified
SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving	Jul 31, 2024	Autonomous DrivingLanguage Modeling	—Unverified
Simple or Complex? Classifying Questions by Answering Complexity	Dec 1, 2012	Question Answering	—Unverified
Simple Question Answering by Attentive Convolutional Neural Network	Jun 10, 2016	Entity LinkingFact Selection	—Unverified
Simple Question Answering with Subgraph Ranking and Joint-Scoring	Apr 4, 2019	Fact SelectionQuestion Answering	—Unverified
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models	Feb 18, 2025	Image ComprehensionQuestion Answering	—Unverified
Simple yet Effective Bridge Reasoning for Open-Domain Multi-Hop Question Answering	Sep 17, 2019	Information RetrievalMulti-hop Question Answering	—Unverified
Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases	Apr 8, 2025	Data IntegrationLanguage Modeling	—Unverified
Simplifying Sparse Expert Recommendation by Revisiting Graph Diffusion	Aug 4, 2022	Community Question AnsweringQuestion Answering	—Unverified
SimpsonsVQA: Enhancing Inquiry-Based Learning with a Tailored Dataset	Oct 30, 2024	Question AnsweringVisual Question Answering	—Unverified
SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains	Oct 23, 2024	Domain AdaptationInstruction Following	—Unverified
SimsterQ: A Similarity based Clustering Approach to Opinion Question Answering	Jul 1, 2020	ClusteringQuestion Answering	—Unverified
Simulating Bandit Learning from User Feedback for Extractive Question Answering	Nov 16, 2021	Extractive Question-AnsweringQuestion Answering	—Unverified
SimVQA: Exploring Simulated Environments for Visual Question Answering	Mar 31, 2022	Data AugmentationDiversity	—Unverified
Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization	May 21, 2025	Open-Domain Question AnsweringQuestion Answering	—Unverified
Single-Modal Entropy based Active Learning for Visual Question Answering	Oct 21, 2021	Active LearningQuestion Answering	—Unverified
Single Training Dimension Selection for Word Embedding with PCA	Aug 30, 2019	Question AnsweringSentiment Analysis	—Unverified
SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback	May 26, 2025	Prompt LearningQuestion Answering	—Unverified
SIRIUS-LTG: An Entity Linking Approach to Fact Extraction and Verification	Nov 1, 2018	Entity LinkingInformation Retrieval	—Unverified
SITE: towards Spatial Intelligence Thorough Evaluation	May 8, 2025	Question AnsweringSpatial Reasoning	—Unverified

Show:10 25 50

← PrevPage 404 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified