SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 78017825 of 10817 papers

TitleStatusHype
Question Answering based Clinical Text Structuring Using Pre-trained Language Model0
Question-Answering Based Summarization of Electronic Health Records using Retrieval Augmented Generation0
Gradient-guided Attention Map Editing: Towards Efficient Contextual Hallucination Mitigation0
GRADE: Quantifying Sample Diversity in Text-to-Image Models0
FINET: Context-Aware Fine-Grained Named Entity Typing0
Question Answering for Complex Electronic Health Records Database using Unified Encoder-Decoder Architecture0
Component Analysis for Visual Question Answering Architectures0
ComPO: Community Preferences for Language Model Personalization0
A State-transition Framework to Answer Complex Questions over Knowledge Base0
Question Answering from Unstructured Text by Retrieval and Comprehension0
A Memory Model for Question Answering from Streaming Data Supported by Rehearsal and Anticipation of Coreference Information0
Question Relatedness on Stack Overflow: The Task, Dataset, and Corpus-inspired Models0
Question Answering in Natural Language: the Special Case of Temporal Expressions0
Question Answering in the Biomedical Domain0
GPTs and Language Barrier: A Cross-Lingual Legal QA Examination0
Question-Answering Model for Schizophrenia Symptoms and Their Impact on Daily Life using Mental Health Forums Data0
Finetuning Language Models to Emit Linguistic Expressions of Uncertainty0
Question Answering on Knowledge Bases and Text using Universal Schema and Memory Networks0
Question Answering on Linked Data: Challenges and Future Directions0
Question Answering on Patient Medical Records with Private Fine-Tuned LLMs0
Question Answering on Scholarly Knowledge Graphs0
Question Answering Over Biological Knowledge Graph via Amazon Alexa0
Question Answering over Curated and Open Web Sources0
GPTR: Gestalt-Perception Transformer for Diagram Object Detection0
CompMix: A Benchmark for Heterogeneous Question Answering0
Show:102550
← PrevPage 313 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified