SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1020110225 of 10817 papers

TitleStatusHype
Stochastic RAG: End-to-End Retrieval-Augmented Generation through Expected Utility Maximization0
STOC-TOT: Stochastic Tree-of-Thought with Constrained Decoding for Complex Reasoning in Multi-Hop Question Answering0
Story Comprehension for Predicting What Happens Next0
Story Generation from Visual Inputs: Techniques, Related Tasks, and Challenges0
StoryQA: Story Grounded Question Answering Dataset0
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering0
Streaming Long Video Understanding with Large Language Models0
Streamlining Biomedical Research with Specialized LLMs0
STREET: A Multi-Task Structured Reasoning and Explanation Benchmark0
StressTest: Can YOUR Speech LM Handle the Stress?0
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks0
Strong and Efficient Baselines for Open Domain Conversational Question Answering0
Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks0
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding0
Structural analysis of an all-purpose question answering model0
Structural Embedding of Syntactic Trees for Machine Comprehension0
Structural Encoding and Pre-training Matter: Adapting BERT for Table-Based Fact Verification0
StructuralLM: Structural Pre-training for Form Understanding0
Structural Representations for Learning Relations between Pairs of Texts0
Structure Causal Models and LLMs Integration in Medical Visual Question Answering0
Structured Alignment Networks for Matching Sentences0
Structured and Logical Representations of Assamese Text for Question-Answering System0
Structured Attention Networks0
Structured Context and High-Coverage Grammar for Conversational Question Answering over Knowledge Graphs0
Structured Event Retrieval over Microblog Archives0
Show:102550
← PrevPage 409 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified