Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3551–3600 of 10817 papers

Title	Date	Tasks	Status
A Concrete Chinese NLP Pipeline	Jun 1, 2015	Coreference ResolutionEntity Linking	—Unverified
EvidenceMap: Learning Evidence Analysis to Unleash the Power of Small Language Models for Biomedical Question Answering	Jan 22, 2025	Answer GenerationGenerative Question Answering	—Unverified
Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform	Jan 1, 2025	Code GenerationImage Generation	—Unverified
Do We Need to Differentiate Negative Candidates Before Training a Neural Ranker?	Nov 16, 2021	Data AugmentationQuestion Answering	—Unverified
An Online Question Answering System based on Sub-graph Searching	Jul 29, 2021	Answer GenerationKnowledge Graphs	—Unverified
E-ViLM: Efficient Video-Language Model via Masked Video Modeling with Semantic Vector-Quantized Tokenizer	Nov 28, 2023	Language ModelingLanguage Modelling	—Unverified
EviNets: Neural Networks for Combining Evidence Signals for Factoid Question Answering	Jul 1, 2017	Answer SelectionFeature Engineering	—Unverified
EvolveSearch: An Iterative Self-Evolving Search Agent	May 28, 2025	Multi-hop Question AnsweringQuestion Answering	—Unverified
Entity Retrieval for Answering Entity-Centric Questions	Aug 5, 2024	Entity RetrievalQuestion Answering	—Unverified
FoRAG: Factuality-optimized Retrieval Augmented Generation for Web-enhanced Long-form Question Answering	Jun 19, 2024	Answer GenerationForm	—Unverified
EVQAScore: Efficient Video Question Answering Data Evaluation	Nov 11, 2024	Keyword ExtractionQuestion Answering	—Unverified
EWEK-QA: Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems	Jun 14, 2024	Question AnsweringRetrieval	—Unverified
Forewords	Dec 1, 2017	Emotion RecognitionIntent Classification	—Unverified
Examining the Commitments and Difficulties Inherent in Multimodal Foundation Models for Street View Imagery	Aug 23, 2024	Question AnsweringZero-Shot Learning	—Unverified
On the Need of Cross Validation for Discourse Relation Classification	Apr 1, 2017	ClassificationGeneral Classification	—Unverified
Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR	May 27, 2024	Question AnsweringTAG	—Unverified
Double Visual Defense: Adversarial Pre-training and Instruction Tuning for Improving Vision-Language Model Robustness	Jan 16, 2025	Adversarial DefenseAdversarial Robustness	—Unverified
A Java Framework for Multilingual Definition and Hypernym Extraction	Aug 1, 2013	Question AnsweringRelation Extraction	—Unverified
Excitatory or Inhibitory: A New Semantic Orientation Extracts Contradiction and Causality from the Web	Jul 1, 2012	Natural Language InferenceQuestion Answering	—Unverified
Double Topic Shifts in Open Domain Conversations: Natural Language Interface for a Wikipedia-based Robot Application	Dec 1, 2016	ArticlesChatbot	—Unverified
Beyond Sentential Semantic Parsing: Tackling the Math SAT with a Cascade of Tree Transducers	Sep 1, 2017	coreference-resolutionCoreference Resolution	—Unverified
CATCH: Complementary Adaptive Token-level Contrastive Decoding to Mitigate Hallucinations in LVLMs	Nov 19, 2024	HallucinationLanguage Modeling	—Unverified
Expanding Frozen Vision-Language Models without Retraining: Towards Improved Robot Perception	Aug 31, 2023	Activity RecognitionHuman Activity Recognition	—Unverified
Expanding the Boundaries of Vision Prior Knowledge in Multi-modal Large Language Models	Mar 23, 2025	Question AnsweringVisual Question Answering	—Unverified
Annotation Scheme for Constructing Sentiment Corpus in Korean	Nov 1, 2012	Document ClassificationQuestion Answering	—Unverified
Double Retrieval and Ranking for Accurate Question Answering	Jan 16, 2022	Answer SelectionQuestion Answering	—Unverified
Expediting and Elevating Large Language Model Reasoning via Hidden Chain-of-Thought Decoding	Sep 13, 2024	Contrastive LearningLanguage Modeling	—Unverified
Categorizing Concepts With Basic Level for Vision-to-Language	Jun 1, 2018	ClusteringImage Captioning	—Unverified
Do Transformers Dream of Inference, or Can Pretrained Generative Models Learn Implicit Inferential Rules?	Nov 1, 2020	Multi-hop Question AnsweringQuestion Answering	—Unverified
Experiments on Hybrid Corpus-Based Sentiment Lexicon Acquisition	Apr 1, 2012	Document ClassificationQuestion Answering	—Unverified
Experiments with Easy-first nonprojective constituent parsing	Aug 1, 2014	Dependency ParsingMachine Translation	—Unverified
Expert Finding in Community Question Answering: A Review	Apr 21, 2018	Community Question AnsweringEnsemble Learning	—Unverified
Annotation Methodologies for Vision and Language Dataset Creation	Jul 10, 2016	Action RecognitionImage Description	—Unverified
Do Transformer Networks Improve the Discovery of Rules from Text?	Jun 1, 2022	Language ModelingLanguage Modelling	—Unverified
Beyond Retrieval: Joint Supervision and Multimodal Document Ranking for Textbook Question Answering	May 17, 2025	Document RankingLarge Language Model	—Unverified
A Framework for the Classification and Annotation of Multiword Expressions in Dialectal Arabic	Oct 1, 2014	Entity Extraction using GANGeneral Classification	—Unverified
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey	Dec 3, 2024	Cross-Modal RetrievalNatural Language Understanding	—Unverified
Explainable Artificial Intelligence Recommendation System by Leveraging the Semantics of Adverse Childhood Experiences: Proof-of-Concept Prototype Development	Nov 6, 2020	Explainable artificial intelligenceGraph Generation	—Unverified
Font-Agent: Enhancing Font Understanding with Large Language Models	Jan 1, 2025	Font GenerationQuestion Answering	—Unverified
Explainable Assessment of Healthcare Articles with QA	May 1, 2022	ArticlesExplanation Generation	—Unverified
DoT: An efficient Double Transformer for NLP tasks with tables	Jun 1, 2021	Question Answering	—Unverified
Do Smaller Language Models Answer Contextualised Questions Through Memorisation Or Generalisation?	Nov 21, 2023	Question AnsweringSemantic Similarity	—Unverified
Explainable Fact-checking through Question Answering	Oct 11, 2021	Decision MakingFact Checking	—Unverified
A RAG-based Question Answering System Proposal for Understanding Islam: MufassirQAS LLM	Jan 27, 2024	ArticlesChatbot	—Unverified
Annotation and Analysis of Discourse Relations, Temporal Relations and Multi-Layered Situational Relations in Japanese Texts	Dec 1, 2016	ArticlesNatural Language Inference	—Unverified
Case-Based Abductive Natural Language Inference	Sep 30, 2020	Natural Language InferenceQuestion Answering	—Unverified
Do Sentence Transformers Learn Quasi-Geospatial Concepts from General Text?	Apr 5, 2024	Question AnsweringRecommendation Systems	—Unverified
DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures	Feb 23, 2024	Question AnsweringText Generation	—Unverified
A Concept-Centric Approach to Multi-Modality Learning	Dec 18, 2024	Image-text matchingQuestion Answering	—Unverified
DoReMi: Grounding Language Model by Detecting and Recovering from Plan-Execution Misalignment	Jul 1, 2023	Language ModelingLanguage Modelling	—Unverified

Show:10 25 50

← PrevPage 72 of 217Next →

All datasets SQuAD2.0 SQuAD1.1 HotpotQA PIQA BoolQ COPA TriviaQA SQuAD1.1 dev Natural Questions OpenBookQA TruthfulQA MultiRC

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	IE-Net (ensemble)	EM	90.94	—	Unverified
2	FPNet (ensemble)	EM	90.87	—	Unverified
3	IE-NetV2 (ensemble)	EM	90.86	—	Unverified
4	SA-Net on Albert (ensemble)	EM	90.72	—	Unverified
5	SA-Net-V2 (ensemble)	EM	90.68	—	Unverified
6	FPNet (ensemble)	EM	90.6	—	Unverified
7	Retro-Reader (ensemble)	EM	90.58	—	Unverified
8	EntitySpanFocusV2 (ensemble)	EM	90.52	—	Unverified
9	TransNets + SFVerifier + SFEnsembler (ensemble)	EM	90.49	—	Unverified
10	EntitySpanFocus+AT (ensemble)	EM	90.45	—	Unverified