SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1022610250 of 10817 papers

TitleStatusHype
Structured Learning for Taxonomy Induction with Belief Propagation0
Structured List-Grounded Question Answering0
Structured Memory Mechanisms for Stable Context Representation in Large Language Models0
Structured Outputs Enable General-Purpose LLMs to be Medical Experts0
Structured Two-stream Attention Network for Video Question Answering0
Structure Guided Multi-modal Pre-trained Transformer for Knowledge Graph Reasoning0
Structure Learning for Neural Module Networks0
Structuring an unordered text document0
STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension0
StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models0
Studio Ousia's Quiz Bowl Question Answering System0
Studying Strategically: Learning to Mask for Closed-book QA0
Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency0
Study of Similarity Measures as Features in Classification for Answer Sentence Selection Task in Hindi Question Answering: Language-Specific v/s Other Measures0
SubGen: Token Generation in Sublinear Time and Memory0
Subgraph Retrieval Enhanced by Graph-Text Alignment for Commonsense Question Answering0
Submodular Minimax Optimization: Finding Effective Sets0
Subspace Approximation for Approximate Nearest Neighbor Search in NLP0
Subtopic Annotation in a Corpus of News Texts: Steps Towards Automatic Subtopic Segmentation0
Successive Prompting for Decomposing Complex Questions0
SUGAR: Leveraging Contextual Confidence for Smarter Retrieval0
SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India0
Summarizing Community-based Question-Answer Pairs0
Summarizing Lengthy Questions0
SUNAR: Semantic Uncertainty based Neighborhood Aware Retrieval for Complex QA0
Show:102550
← PrevPage 410 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified