Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5176–5200 of 10817 papers

Title	Date	Tasks	Status	Hype
Emotion Twenty Questions Dialog System for Lexical Emotional Intelligence	Oct 5, 2022	Emotional IntelligenceQuestion Answering	CodeCode Available	0
Honest Students from Untrusted Teachers: Learning an Interpretable Question-Answering Pipeline from a Pretrained Language Model	Oct 5, 2022	In-Context LearningLanguage Modeling	—Unverified	0
Locate before Answering: Answer Guided Question Localization for Video Question Answering	Oct 5, 2022	Question AnsweringVideo Question Answering	—Unverified	0
Large Language Models are Pretty Good Zero-Shot Video Game Bug Detectors	Oct 5, 2022	Common Sense ReasoningLanguage Modelling	CodeCode Available	1
Towards Improving Faithfulness in Abstractive Summarization	Oct 4, 2022	Abstractive Text SummarizationDecoder	CodeCode Available	1
Detect, Retrieve, Comprehend: A Flexible Framework for Zero-Shot Document-Level Question Answering	Oct 4, 2022	Question AnsweringRetrieval	—Unverified	0
Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering	Oct 4, 2022	Question Answering	CodeCode Available	1
Mining Duplicate Questions of Stack Overflow	Oct 4, 2022	Community Question AnsweringQuestion Answering	—Unverified	0
Recitation-Augmented Language Models	Oct 4, 2022	Natural QuestionsQuestion Answering	CodeCode Available	1
Transformer-based Subject Entity Detection in Wikipedia Listings	Oct 4, 2022	Knowledge GraphsQuestion Answering	—Unverified	0
When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment	Oct 4, 2022	Language ModellingLarge Language Model	CodeCode Available	1
Russian Web Tables: A Public Corpus of Web Tables for Russian Language Based on Wikipedia	Oct 3, 2022	Knowledge Base ConstructionManagement	CodeCode Available	0
Extending Compositional Attention Networks for Social Reasoning in Videos	Oct 3, 2022	Question AnsweringVideo Question Answering	CodeCode Available	0
Understanding Prior Bias and Choice Paralysis in Transformer-based Language Representation Models through Four Experimental Probes	Oct 3, 2022	Decision MakingMultiple-choice	—Unverified	0
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought	Oct 3, 2022	Mathematical ReasoningQuestion Answering	CodeCode Available	3
How Relevant is Selective Memory Population in Lifelong Language Learning?	Oct 3, 2022	Lifelong learningQuestion Answering	—Unverified	0
Findings of the VarDial Evaluation Campaign 2022	Oct 1, 2022	Dialect IdentificationExtractive Question-Answering	CodeCode Available	0
On the Effects of Video Grounding on Language Models	Oct 1, 2022	Image CaptioningQuestion Answering	—Unverified	0
Evaluating Coreference Resolvers on Community-based Question Answering: From Rule-based to State of the Art	Oct 1, 2022	Answer Selectioncoreference-resolution	CodeCode Available	0
HaleLab_NITK@SMM4H’22: Adaptive Learning Model for Effective Detection, Extraction and Normalization of Adverse Drug Events from Social Media Data	Oct 1, 2022	Question Answering	CodeCode Available	0
CMQA: A Dataset of Conditional Question Answering with Multiple-Span Answers	Oct 1, 2022	Question Answering	CodeCode Available	0
Focus on FoCus: Is FoCus focused on Context, Knowledge and Persona?	Oct 1, 2022	Dialogue GenerationQuestion Answering	—Unverified	0
ArT: All-round Thinker for Unsupervised Commonsense Question Answering	Oct 1, 2022	AllQuestion Answering	CodeCode Available	0
Aligning Multilingual Embeddings for Improved Code-switched Natural Language Understanding	Oct 1, 2022	named-entity-recognitionNamed Entity Recognition	CodeCode Available	0
Are Visual-Linguistic Models Commonsense Knowledge Bases?	Oct 1, 2022	Natural Language UnderstandingQuestion Answering	CodeCode Available	0

Show:10 25 50

← PrevPage 208 of 433Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified