SOTAVerified

Question Answering

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Papers

Showing 1007610100 of 10817 papers

TitleStatusHype
Similarity-Based Reconstruction Loss for Meaning Representation0
Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction0
Simple and Effective Semi-Supervised Question Answering0
Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval0
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps0
Simple Large-scale Relation Extraction from Unstructured Text0
SimpleLLM4AD: An End-to-End Vision-Language Model with Graph Visual Question Answering for Autonomous Driving0
Simple or Complex? Classifying Questions by Answering Complexity0
Simple Question Answering by Attentive Convolutional Neural Network0
Simple Question Answering with Subgraph Ranking and Joint-Scoring0
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models0
Simple yet Effective Bridge Reasoning for Open-Domain Multi-Hop Question Answering0
Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases0
Simplifying Sparse Expert Recommendation by Revisiting Graph Diffusion0
SimpsonsVQA: Enhancing Inquiry-Based Learning with a Tailored Dataset0
SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains0
SimsterQ: A Similarity based Clustering Approach to Opinion Question Answering0
Simulating Bandit Learning from User Feedback for Extractive Question Answering0
SimVQA: Exploring Simulated Environments for Visual Question Answering0
Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization0
Single-Modal Entropy based Active Learning for Visual Question Answering0
Single Training Dimension Selection for Word Embedding with PCA0
SIPDO: Closed-Loop Prompt Optimization via Synthetic Data Feedback0
SIRIUS-LTG: An Entity Linking Approach to Fact Extraction and Verification0
SITE: towards Spatial Intelligence Thorough Evaluation0
Show:102550
← PrevPage 404 of 433Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IE-Net (ensemble)EM90.94Unverified
2FPNet (ensemble)EM90.87Unverified
3IE-NetV2 (ensemble)EM90.86Unverified
4SA-Net on Albert (ensemble)EM90.72Unverified
5SA-Net-V2 (ensemble)EM90.68Unverified
6FPNet (ensemble)EM90.6Unverified
7Retro-Reader (ensemble)EM90.58Unverified
8EntitySpanFocusV2 (ensemble)EM90.52Unverified
9TransNets + SFVerifier + SFEnsembler (ensemble)EM90.49Unverified
10EntitySpanFocus+AT (ensemble)EM90.45Unverified