SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1020110250 of 10817 papers

TitleStatusHype
STOC-TOT: Stochastic Tree-of-Thought with Constrained Decoding for Complex Reasoning in Multi-Hop Question Answering0
Story Comprehension for Predicting What Happens Next0
Story Generation from Visual Inputs: Techniques, Related Tasks, and Challenges0
StoryQA: Story Grounded Question Answering Dataset0
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering0
Streaming Long Video Understanding with Large Language Models0
Streamlining Biomedical Research with Specialized LLMs0
STREET: A Multi-Task Structured Reasoning and Explanation Benchmark0
StressTest: Can YOUR Speech LM Handle the Stress?0
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks0
Strong and Efficient Baselines for Open Domain Conversational Question Answering0
Strong Baselines for Simple Question Answering over Knowledge Graphs with and without Neural Networks0
StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding0
Structural analysis of an all-purpose question answering model0
Structural Embedding of Syntactic Trees for Machine Comprehension0
Structural Encoding and Pre-training Matter: Adapting BERT for Table-Based Fact Verification0
StructuralLM: Structural Pre-training for Form Understanding0
Structural Representations for Learning Relations between Pairs of Texts0
Structure Causal Models and LLMs Integration in Medical Visual Question Answering0
Structured Alignment Networks for Matching Sentences0
Structured and Logical Representations of Assamese Text for Question-Answering System0
Structured Attention Networks0
Structured Context and High-Coverage Grammar for Conversational Question Answering over Knowledge Graphs0
Structured Event Retrieval over Microblog Archives0
Structured Learning for Taxonomy Induction with Belief Propagation0
Structured List-Grounded Question Answering0
Structured Memory Mechanisms for Stable Context Representation in Large Language Models0
Structured Outputs Enable General-Purpose LLMs to be Medical Experts0
Structured Two-stream Attention Network for Video Question Answering0
Structure Guided Multi-modal Pre-trained Transformer for Knowledge Graph Reasoning0
Structure Learning for Neural Module Networks0
Structuring an unordered text document0
STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension0
StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models0
Studio Ousia's Quiz Bowl Question Answering System0
Studying Strategically: Learning to Mask for Closed-book QA0
Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency0
Study of Similarity Measures as Features in Classification for Answer Sentence Selection Task in Hindi Question Answering: Language-Specific v/s Other Measures0
SubGen: Token Generation in Sublinear Time and Memory0
Subgraph Retrieval Enhanced by Graph-Text Alignment for Commonsense Question Answering0
Submodular Minimax Optimization: Finding Effective Sets0
Subspace Approximation for Approximate Nearest Neighbor Search in NLP0
Subtopic Annotation in a Corpus of News Texts: Steps Towards Automatic Subtopic Segmentation0
Successive Prompting for Decomposing Complex Questions0
SUGAR: Leveraging Contextual Confidence for Smarter Retrieval0
SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India0
Summarizing Community-based Question-Answer Pairs0
Summarizing Lengthy Questions0
SUNAR: Semantic Uncertainty based Neighborhood Aware Retrieval for Complex QA0
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation0
Show:102550
← PrevPage 205 of 217Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified