Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5601–5650 of 10817 papers

Title	Date	Tasks	Status
Semantic-aware Dynamic Retrospective-Prospective Reasoning for Event-level Video Question Answering	May 14, 2023	Question AnsweringSemantic Role Labeling	—Unverified
Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering	May 14, 2023	Explanation GenerationQuestion Answering	—Unverified
Learning to Generalize for Cross-domain QA	May 14, 2023	Data AugmentationDomain Generalization	CodeCode Available
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples	May 13, 2023	Data AugmentationNatural Language Inference	CodeCode Available
HPE:Answering Complex Questions over Text by Hybrid Question Parsing and Execution	May 12, 2023	Knowledge GraphsQuestion Answering	—Unverified
Improving Small Language Models on PubMedQA via Generative Data Augmentation	May 12, 2023	Data AugmentationQuestion Answering	—Unverified
Implications of Deep Circuits in Improving Quality of Quantum Question Answering	May 12, 2023	Quantum Machine LearningQuestion Answering	—Unverified
When Giant Language Brains Just Aren't Enough! Domain Pizzazz with Knowledge Sparkle Dust	May 12, 2023	Domain AdaptationQuestion Answering	—Unverified
A Memory Model for Question Answering from Streaming Data Supported by Rehearsal and Anticipation of Coreference Information	May 12, 2023	MemorizationQuestion Answering	—Unverified
Overinformative Question Answering by Humans and Machines	May 11, 2023	Question Answering	—Unverified
Think Twice: Measuring the Efficiency of Eliminating Prediction Shortcuts of Question Answering Models	May 11, 2023	Question Answering	CodeCode Available
Long-Tailed Question Answering in an Open World	May 11, 2023	Knowledge DistillationLanguage Modelling	—Unverified
Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks	May 10, 2023	Binary Classificationnamed-entity-recognition	—Unverified
Multi-hop Commonsense Knowledge Injection Framework for Zero-Shot Commonsense Question Answering	May 10, 2023	Contrastive LearningKnowledge Graphs	—Unverified
A Glimpse in ChatGPT Capabilities and its impact for AI research	May 10, 2023	Question AnsweringText Generation	—Unverified
Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense Knowledge	May 10, 2023	Language ModelingLanguage Modelling	CodeCode Available
Unsupervised Dense Retrieval Training with Web Anchors	May 10, 2023	Contrastive LearningQuestion Answering	CodeCode Available
MAUPQA: Massive Automatically-created Polish Question Answering Dataset	May 9, 2023	Open-Domain Question AnsweringPassage Retrieval	—Unverified
Large Language Models Need Holistically Thought in Medical Conversational QA	May 9, 2023	Conversational Question AnsweringQuestion Answering	CodeCode Available
Large Language Model Programs	May 9, 2023	Language ModelingLanguage Modelling	—Unverified
Event Knowledge Incorporation with Posterior Regularization for Event-Centric Question Answering	May 8, 2023	Language ModellingQuestion Answering	CodeCode Available
A Frustratingly Easy Improvement for Position Embeddings via Random Padding	May 8, 2023	Extractive Question-AnsweringPosition	—Unverified
Knowledge-enhanced Agents for Interactive Text Games	May 8, 2023	Instruction FollowingKnowledge Graphs	—Unverified
SkillQG: Learning to Generate Question for Reading Comprehension Assessment	May 8, 2023	Machine Reading ComprehensionQuestion Answering	—Unverified
FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering	May 7, 2023	Fact CheckingFact Verification	—Unverified
OpenViVQA: Task, Dataset, and Multimodal Fusion Models for Visual Question Answering in Vietnamese	May 7, 2023	Information RetrievalQuestion Answering	CodeCode Available
Adaptive loose optimization for robust question answering	May 6, 2023	Extractive Question-AnsweringMachine Reading Comprehension	CodeCode Available
From Parse-Execute to Parse-Execute-Refine: Improving Semantic Parser for Complex Question Answering over Knowledge Base	May 5, 2023	Knowledge Base Question AnsweringQuestion Answering	—Unverified
NewsQuote: A Dataset Built on Quote Extraction and Attribution for Expert Recommendation in Fact-Checking	May 5, 2023	ArticlesFact Checking	CodeCode Available
Multi-View Graph Representation Learning for Answering Hybrid Numerical Reasoning Question	May 5, 2023	DecoderGraph Representation Learning	CodeCode Available
Faithful Question Answering with Monte-Carlo Planning	May 4, 2023	Decision MakingQuestion Answering	CodeCode Available
DomainInv: Domain Invariant Fine Tuning and Adversarial Label Correction For QA Domain Adaptation	May 4, 2023	Domain AdaptationQuestion Answering	—Unverified
ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos	May 4, 2023	Question AnsweringSpatio-temporal Scene Graphs	CodeCode Available
An automatically discovered chain-of-thought prompt generalizes to novel models and datasets	May 4, 2023	Question Answering	—Unverified
VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation	May 4, 2023	DecoderQuestion Answering	—Unverified
Analysis of Visual Question Answering Algorithms with attention model	May 4, 2023	Question AnsweringVisual Question Answering	—Unverified
Gpt-4: A Review on Advancements and Opportunities in Natural Language Processing	May 4, 2023	Language ModelingLanguage Modelling	—Unverified
Pay More Attention to Relation Exploration for Knowledge Base Question Answering	May 3, 2023	Knowledge Base Question AnsweringQuestion Answering	—Unverified
AttenWalker: Unsupervised Long-Document Question Answering via Attention-based Graph Walking	May 3, 2023	Few-Shot LearningQuestion Answering	CodeCode Available
Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings	May 3, 2023	Data AugmentationQuestion Answering	—Unverified
Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime	May 3, 2023	Image CaptioningQuestion Answering	—Unverified
MAFiD: Moving Average Equipped Fusion-in-Decoder for Question Answering over Tabular and Textual Data	May 2, 2023	DecoderQuestion Answering	CodeCode Available
CHIC: Corporate Document for Visual question Answering	May 1, 2023	Information RetrievalQuestion Answering	—Unverified
Multimodal Graph Transformer for Multimodal Question Answering	Apr 30, 2023	Question Answering	—Unverified
Prompt Engineering for Healthcare: Methodologies and Applications	Apr 28, 2023	Machine TranslationPrompt Engineering	—Unverified
ChatGPT in the Classroom: An Analysis of Its Strengths and Weaknesses for Solving Undergraduate Computer Science Questions	Apr 28, 2023	ChatbotLanguage Modeling	—Unverified
Analyzing Vietnamese Legal Questions Using Deep Neural Networks with Biaffine Classifiers	Apr 27, 2023	Dependency ParsingPOS	CodeCode Available
q2d: Turning Questions into Dialogs to Teach Models How to Search	Apr 27, 2023	Language ModellingLarge Language Model	—Unverified
HeySQuAD: A Spoken Question Answering Dataset	Apr 26, 2023	Question Answering	CodeCode Available
Semantic Compression With Large Language Models	Apr 25, 2023	Code GenerationInformation Retrieval	—Unverified

Show:10 25 50

← PrevPage 113 of 217Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified