Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10026–10050 of 10817 papers

Title	Date	Tasks	Status
Semi-Supervised Answer Extraction from Discussion Forums	Oct 1, 2013	Community Question AnsweringQuestion Answering	—Unverified
Semi-Supervised Disfluency Detection	Aug 1, 2018	Generative Adversarial NetworkMachine Translation	—Unverified
Semi-Supervised QA with Generative Domain-Adaptive Nets	Feb 7, 2017	Domain AdaptationQuestion Answering	—Unverified
Semi-supervised Training Data Generation for Multilingual Question Answering	May 1, 2018	Machine TranslationNamed Entity Recognition (NER)	—Unverified
SemPool: Simple, robust, and interpretable KG pooling for enhancing language models	Feb 3, 2024	Question Answering	—Unverified
SemR-11: A Multi-Lingual Gold-Standard for Semantic Similarity and Relatedness for Eleven Languages	May 1, 2018	Information RetrievalMachine Translation	—Unverified
Sense and Similarity: A Study of Sense-level Similarity Measures	Aug 1, 2014	Information RetrievalQuestion Answering	—Unverified
Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT	Jul 14, 2023	QNLIQQP	—Unverified
Sensor2Text: Enabling Natural Language Interactions for Daily Activity Tracking Using Wearable Sensors	Oct 26, 2024	Question AnsweringTransfer Learning	—Unverified
SensorChat: Answering Qualitative and Quantitative Questions during Long-Term Multimodal Sensor Interactions	Feb 5, 2025	QuantizationQuestion Answering	—Unverified
Sentence Alignment using Unfolding Recursive Autoencoders	Aug 1, 2017	Information RetrievalMachine Translation	—Unverified
Sentence Attention Blocks for Answer Grounding	Sep 20, 2023	Question AnsweringSentence	—Unverified
Sentence Dependency Tagging in Online Question Answering Forums	Jul 1, 2012	Question AnsweringSentence	—Unverified
Sentence Extraction-Based Machine Reading Comprehension for Vietnamese	May 19, 2021	ArticlesMachine Reading Comprehension	—Unverified
Sentences as connection paths: A neural language architecture of sentence structure in the brain	May 19, 2022	Question AnsweringSentence	—Unverified
Sentential Paraphrase Generation for Agglutinative Languages Using SVM with a String Kernel	Dec 1, 2014	Document SummarizationMachine Translation	—Unverified
SentiHood: Targeted Aspect Based Sentiment Analysis Dataset for Urban Neighbourhoods	Oct 12, 2016	Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA)	—Unverified
Sentiment and Belief: How to Think about, Represent, and Annotate Private States	Jul 1, 2015	Opinion MiningQuestion Answering	—Unverified
Sentiment Classification towards Question-Answering with Hierarchical Matching Network	Oct 1, 2018	ClassificationGeneral Classification	—Unverified
senti.ue-en: an approach for informally written short texts in SemEval-2013 Sentiment Analysis task	Jun 1, 2013	Question AnsweringSentiment Analysis	—Unverified
Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering	Jan 1, 2025	Multiple-choiceQuestion Answering	—Unverified
Seq2seq for Morphological Reinflection: When Deep Learning Fails	Aug 1, 2017	Deep LearningMachine Translation	—Unverified
Is the House Ready For Sleeptime? Generating and Evaluating Situational Queries for Embodied Question Answering	May 8, 2024	2kEmbodied Question Answering	—Unverified
Sequence-to-Sequence Knowledge Graph Completion and Question Answering	Nov 16, 2021	DecoderGraph Embedding	—Unverified
Sequence-to-Sequence Learning on Keywords for Efficient FAQ Retrieval	Aug 23, 2021	Keyword ExtractionQuestion Answering	—Unverified

Show:10 25 50

← PrevPage 402 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified