Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1351–1400 of 10817 papers

Title	Date	Tasks	Status	Hype
Multimodality Representation Learning: A Survey on Evolution, Pretraining and Its Applications	Feb 1, 2023	Question AnsweringRepresentation Learning	CodeCode Available	1
PADL: Language-Directed Physics-Based Character Control	Jan 31, 2023	Image GenerationImitation Learning	CodeCode Available	1
Can an AI Win Ghana's National Science and Maths Quiz? An AI Grand Challenge for Education	Jan 30, 2023	MathPosition	CodeCode Available	1
Semantic Parsing for Conversational Question Answering over Knowledge Graphs	Jan 28, 2023	Conversational Question AnsweringKnowledge Graphs	CodeCode Available	1
A Comparative Study of Pretrained Language Models for Long Clinical Text	Jan 27, 2023	Clinical KnowledgeDocument Classification	CodeCode Available	1
ViDeBERTa: A powerful pre-trained language model for Vietnamese	Jan 25, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images	Jan 12, 2023	Evidence SelectionQuestion Answering	CodeCode Available	1
Multimodal Inverse Cloze Task for Knowledge-based Visual Question Answering	Jan 11, 2023	Question AnsweringReading Comprehension	CodeCode Available	1
Mind Reasoning Manners: Enhancing Type Perception for Generalized Zero-shot Logical Reasoning over Text	Jan 8, 2023	Contrastive LearningLogical Reasoning	CodeCode Available	1
SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph	Jan 5, 2023	Question Answering	CodeCode Available	1
Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training	Jan 1, 2023	3D dense captioning3D visual grounding	CodeCode Available	1
VQACL: A Novel Visual Question Answering Continual Learning Setting	Jan 1, 2023	Continual LearningQuestion Answering	CodeCode Available	1
Variational Causal Inference Network for Explanatory Visual Question Answering	Jan 1, 2023	Explanation GenerationExplanatory Visual Question Answering	CodeCode Available	1
Rethinking with Retrieval: Faithful Large Language Model Inference	Dec 31, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Large Language Models Encode Clinical Knowledge	Dec 26, 2022	Clinical KnowledgeMedQA	CodeCode Available	1
OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization	Dec 22, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Parallel Context Windows for Large Language Models	Dec 21, 2022	In-Context LearningPlaying the Game of 2048	CodeCode Available	1
Are Deep Neural Networks SMARTer than Second Graders?	Dec 20, 2022	Language ModellingMeta-Learning	CodeCode Available	1
Optimization Techniques for Unsupervised Complex Table Reasoning via Self-Training Framework	Dec 20, 2022	Data AugmentationFact Verification	CodeCode Available	1
Evaluating Human-Language Model Interaction	Dec 19, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments	Dec 19, 2022	In-Context LearningKnowledge Base Question Answering	CodeCode Available	1
MIST: Multi-modal Iterative Spatial-Temporal Transformer for Long-form Video Question Answering	Dec 19, 2022	FormQuestion Answering	CodeCode Available	1
Visconde: Multi-document QA with GPT-3 and Neural Reranking	Dec 19, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model	Dec 18, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Self-Prompting Large Language Models for Zero-Shot Open-Domain QA	Dec 16, 2022	In-Context LearningOpen-Domain Question Answering	CodeCode Available	1
Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation	Dec 16, 2022	Answer GenerationDecoder	CodeCode Available	1
Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models	Dec 15, 2022	AttributeQuestion Answering	CodeCode Available	1
APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning	Dec 14, 2022	Conversational Question AnsweringDiversity	CodeCode Available	1
VindLU: A Recipe for Effective Video-and-Language Pretraining	Dec 9, 2022	Question AnsweringRetrieval	CodeCode Available	1
Hierarchical multimodal transformers for Multi-Page DocVQA	Dec 7, 2022	DecoderQuestion Answering	CodeCode Available	1
Retrieval as Attention: End-to-end Learning of Retrieval and Reading within a Single Transformer	Dec 5, 2022	Open-Domain Question AnsweringPassage Retrieval	CodeCode Available	1
UniKGQA: Unified Retrieval and Reasoning for Solving Multi-hop Question Answering Over Knowledge Graph	Dec 2, 2022	Language ModellingMulti-hop Question Answering	CodeCode Available	1
Nonparametric Masked Language Modeling	Dec 2, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Relation-Aware Language-Graph Transformer for Question Answering	Dec 2, 2022	Medical Question AnsweringMedQA	CodeCode Available	1
A Sequential Flow Control Framework for Multi-hop Knowledge Base Question Answering	Dec 1, 2022	Knowledge Base Question AnsweringQuestion Answering	CodeCode Available	1
Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning	Dec 1, 2022	Domain GeneralizationQuestion Answering	CodeCode Available	1
AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning	Nov 30, 2022	AllMulti-Task Learning	CodeCode Available	1
CREPE: Open-Domain Question Answering with False Presuppositions	Nov 30, 2022	Open-Domain Question AnsweringQuestion Answering	CodeCode Available	1
Frustratingly Easy Label Projection for Cross-lingual Transfer	Nov 28, 2022	Cross-Lingual NERCross-Lingual Transfer	CodeCode Available	1
Self-supervised vision-language pretraining for Medical visual question answering	Nov 24, 2022	Contrastive LearningImage-text matching	CodeCode Available	1
Seeing What You Miss: Vision-Language Pre-training with Semantic Completion Learning	Nov 24, 2022	cross-modal alignmentImage-text Retrieval	CodeCode Available	1
Hengam: An Adversarially Trained Transformer for Persian Temporal Tagging	Nov 20, 2022	Information RetrievalNamed Entity Recognition (NER)	CodeCode Available	1
Visual Commonsense-aware Representation Network for Video Captioning	Nov 17, 2022	Caption GenerationQuestion Answering	CodeCode Available	1
I Can't Believe There's No Images! Learning Visual Tasks Using only Language Supervision	Nov 17, 2022	Image CaptioningQuestion Answering	CodeCode Available	1
MapQA: A Dataset for Question Answering on Choropleth Maps	Nov 15, 2022	ArticlesQuestion Answering	CodeCode Available	1
QAmeleon: Multilingual QA with Only 5 Examples	Nov 15, 2022	Few-Shot LearningQuestion Answering	CodeCode Available	1
PromptCap: Prompt-Guided Task-Aware Image Captioning	Nov 15, 2022	Image CaptioningLanguage Modelling	CodeCode Available	1
Large Language Models Struggle to Learn Long-Tail Knowledge	Nov 15, 2022	Entity LinkingQuestion Answering	CodeCode Available	1
Retrieval-Augmented Generative Question Answering for Event Argument Extraction	Nov 14, 2022	Event Argument ExtractionFew-Shot Learning	CodeCode Available	1
Mining Mathematical Documents for Question Answering via Unsupervised Formula Labeling	Nov 12, 2022	Entity LinkingKnowledge Graphs	CodeCode Available	1

Show:10 25 50

← PrevPage 28 of 217Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified