Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3221–3230 of 10817 papers

Title	Date	Tasks	Status
Empirical Methods for the Study of Denotation in Nominalizations in Spanish	Jan 1, 2012	Machine TranslationNatural Language Inference	—Unverified
Empirical Risk Minimization for Probabilistic Grammars: Sample Complexity and Hardness of Learning	Jan 1, 2012	Machine TranslationQuestion Answering	—Unverified
A Novel Computational and Modeling Foundation for Automatic Coherence Assessment	Oct 1, 2023	4kLong Form Question Answering	—Unverified
Empirical Study on Deep Learning Models for Question Answering	Oct 26, 2015	Deep LearningMachine Translation	—Unverified
Employing Text Matching Network to Recognise Nuclearity in Chinese Discourse	Aug 1, 2018	Question AnsweringText Matching	—Unverified
A Frustratingly Easy Improvement for Position Embeddings via Random Padding	May 8, 2023	Extractive Question-AnsweringPosition	—Unverified
DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning	Jan 16, 2022	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Empowering Language Models with Active Inquiry for Deeper Understanding	Feb 6, 2024	Active LearningLanguage Modeling	—Unverified
Empowering Language Models with Knowledge Graph Reasoning for Question Answering	Nov 15, 2022	Knowledge GraphsLanguage Modeling	—Unverified
Bilingually-constrained Synthetic Data for Implicit Discourse Relation Recognition	Nov 1, 2016	Domain AdaptationMachine Translation	—Unverified

Show:10 25 50

← PrevPage 323 of 1082Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified